Overview

Dataset statistics

Number of variables183
Number of observations1765
Missing cells10849
Missing cells (%)3.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 MiB
Average record size in memory1.4 KiB

Variable types

Categorical181
Numeric2

Alerts

('P0', 'id') has a high cardinality: 1765 distinct values High cardinality
('P35', 'other') has a high cardinality: 70 distinct values High cardinality
('P1', 'age') is highly correlated with salarioHigh correlation
('P19', 'is_data_science_professional') is highly correlated with ('P20', 'linear_regression') and 11 other fieldsHigh correlation
('P20', 'linear_regression') is highly correlated with ('P19', 'is_data_science_professional') and 8 other fieldsHigh correlation
('P20', 'logistic_regression') is highly correlated with ('P19', 'is_data_science_professional') and 6 other fieldsHigh correlation
('P20', 'decision_tree') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P20', 'random_forest') is highly correlated with ('P20', 'linear_regression') and 7 other fieldsHigh correlation
('P20', 'neural_networks') is highly correlated with ('P20', 'cnns') and 1 other fieldsHigh correlation
('P20', 'ensemble') is highly correlated with ('P20', 'random_forest') and 2 other fieldsHigh correlation
('P20', 'svms') is highly correlated with ('P20', 'random_forest') and 1 other fieldsHigh correlation
('P20', 'cnns') is highly correlated with ('P20', 'neural_networks') and 2 other fieldsHigh correlation
('P20', 'rnns') is highly correlated with ('P20', 'neural_networks') and 1 other fieldsHigh correlation
('P20', 'nlp') is highly correlated with ('P23', 'nlp')High correlation
('P20', 'gradient_boosted_machines') is highly correlated with ('P20', 'random_forest') and 1 other fieldsHigh correlation
('P20', 'cluster_analysis') is highly correlated with ('P20', 'linear_regression') and 3 other fieldsHigh correlation
('P21', 'sql_') is highly correlated with ('P19', 'is_data_science_professional') and 5 other fieldsHigh correlation
('P21', 'python') is highly correlated with ('P19', 'is_data_science_professional') and 10 other fieldsHigh correlation
('P23', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 8 other fieldsHigh correlation
('P23', 'nosql') is highly correlated with ('P19', 'is_data_science_professional') and 3 other fieldsHigh correlation
('P23', 'images') is highly correlated with ('P20', 'cnns')High correlation
('P23', 'nlp') is highly correlated with ('P20', 'nlp')High correlation
('P23', 'sheets') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P24', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 4 other fieldsHigh correlation
('P24', 'nosql') is highly correlated with ('P23', 'nosql')High correlation
('P24', 'planilhas') is highly correlated with ('P19', 'is_data_science_professional') and 1 other fieldsHigh correlation
('P25', 'aws') is highly correlated with ('P19', 'is_data_science_professional') and 3 other fieldsHigh correlation
('P26', 'mongodb') is highly correlated with ('P23', 'nosql')High correlation
('P26', 's3') is highly correlated with ('P25', 'aws')High correlation
('P27', 'microsoft_powerbi') is highly correlated with ('P19', 'is_data_science_professional')High correlation
('P31', 'data_hackers_blog') is highly correlated with ('P31', 'do_not_know_data_hackers')High correlation
('P31', 'data_hackers_podcast') is highly correlated with ('P31', 'do_not_know_data_hackers')High correlation
('P31', 'do_not_know_data_hackers') is highly correlated with ('P31', 'data_hackers_blog') and 1 other fieldsHigh correlation
salario is highly correlated with ('P1', 'age')High correlation
('P19', 'is_data_science_professional') is highly correlated with ('P20', 'linear_regression') and 11 other fieldsHigh correlation
('P20', 'linear_regression') is highly correlated with ('P19', 'is_data_science_professional') and 8 other fieldsHigh correlation
('P20', 'logistic_regression') is highly correlated with ('P19', 'is_data_science_professional') and 6 other fieldsHigh correlation
('P20', 'decision_tree') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P20', 'random_forest') is highly correlated with ('P20', 'linear_regression') and 7 other fieldsHigh correlation
('P20', 'neural_networks') is highly correlated with ('P20', 'cnns') and 1 other fieldsHigh correlation
('P20', 'ensemble') is highly correlated with ('P20', 'random_forest') and 2 other fieldsHigh correlation
('P20', 'svms') is highly correlated with ('P20', 'random_forest') and 1 other fieldsHigh correlation
('P20', 'cnns') is highly correlated with ('P20', 'neural_networks') and 2 other fieldsHigh correlation
('P20', 'rnns') is highly correlated with ('P20', 'neural_networks') and 1 other fieldsHigh correlation
('P20', 'nlp') is highly correlated with ('P23', 'nlp')High correlation
('P20', 'gradient_boosted_machines') is highly correlated with ('P20', 'random_forest') and 1 other fieldsHigh correlation
('P20', 'cluster_analysis') is highly correlated with ('P20', 'linear_regression') and 3 other fieldsHigh correlation
('P21', 'sql_') is highly correlated with ('P19', 'is_data_science_professional') and 5 other fieldsHigh correlation
('P21', 'python') is highly correlated with ('P19', 'is_data_science_professional') and 10 other fieldsHigh correlation
('P23', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 8 other fieldsHigh correlation
('P23', 'nosql') is highly correlated with ('P19', 'is_data_science_professional') and 3 other fieldsHigh correlation
('P23', 'images') is highly correlated with ('P20', 'cnns')High correlation
('P23', 'nlp') is highly correlated with ('P20', 'nlp')High correlation
('P23', 'sheets') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P24', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 4 other fieldsHigh correlation
('P24', 'nosql') is highly correlated with ('P23', 'nosql')High correlation
('P24', 'planilhas') is highly correlated with ('P19', 'is_data_science_professional') and 1 other fieldsHigh correlation
('P25', 'aws') is highly correlated with ('P19', 'is_data_science_professional') and 3 other fieldsHigh correlation
('P26', 'mongodb') is highly correlated with ('P23', 'nosql')High correlation
('P26', 's3') is highly correlated with ('P25', 'aws')High correlation
('P27', 'microsoft_powerbi') is highly correlated with ('P19', 'is_data_science_professional')High correlation
('P31', 'data_hackers_blog') is highly correlated with ('P31', 'do_not_know_data_hackers')High correlation
('P31', 'data_hackers_podcast') is highly correlated with ('P31', 'do_not_know_data_hackers')High correlation
('P31', 'do_not_know_data_hackers') is highly correlated with ('P31', 'data_hackers_blog') and 1 other fieldsHigh correlation
('P19', 'is_data_science_professional') is highly correlated with ('P20', 'linear_regression') and 11 other fieldsHigh correlation
('P20', 'linear_regression') is highly correlated with ('P19', 'is_data_science_professional') and 8 other fieldsHigh correlation
('P20', 'logistic_regression') is highly correlated with ('P19', 'is_data_science_professional') and 6 other fieldsHigh correlation
('P20', 'decision_tree') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P20', 'random_forest') is highly correlated with ('P20', 'linear_regression') and 7 other fieldsHigh correlation
('P20', 'neural_networks') is highly correlated with ('P20', 'cnns') and 1 other fieldsHigh correlation
('P20', 'ensemble') is highly correlated with ('P20', 'random_forest') and 2 other fieldsHigh correlation
('P20', 'svms') is highly correlated with ('P20', 'random_forest') and 1 other fieldsHigh correlation
('P20', 'cnns') is highly correlated with ('P20', 'neural_networks') and 2 other fieldsHigh correlation
('P20', 'rnns') is highly correlated with ('P20', 'neural_networks') and 1 other fieldsHigh correlation
('P20', 'nlp') is highly correlated with ('P23', 'nlp')High correlation
('P20', 'gradient_boosted_machines') is highly correlated with ('P20', 'random_forest') and 1 other fieldsHigh correlation
('P20', 'cluster_analysis') is highly correlated with ('P20', 'linear_regression') and 3 other fieldsHigh correlation
('P21', 'sql_') is highly correlated with ('P19', 'is_data_science_professional') and 5 other fieldsHigh correlation
('P21', 'python') is highly correlated with ('P19', 'is_data_science_professional') and 10 other fieldsHigh correlation
('P23', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 8 other fieldsHigh correlation
('P23', 'nosql') is highly correlated with ('P19', 'is_data_science_professional') and 3 other fieldsHigh correlation
('P23', 'images') is highly correlated with ('P20', 'cnns')High correlation
('P23', 'nlp') is highly correlated with ('P20', 'nlp')High correlation
('P23', 'sheets') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P24', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 4 other fieldsHigh correlation
('P24', 'nosql') is highly correlated with ('P23', 'nosql')High correlation
('P24', 'planilhas') is highly correlated with ('P19', 'is_data_science_professional') and 1 other fieldsHigh correlation
('P25', 'aws') is highly correlated with ('P19', 'is_data_science_professional') and 3 other fieldsHigh correlation
('P26', 'mongodb') is highly correlated with ('P23', 'nosql')High correlation
('P26', 's3') is highly correlated with ('P25', 'aws')High correlation
('P27', 'microsoft_powerbi') is highly correlated with ('P19', 'is_data_science_professional')High correlation
('P31', 'data_hackers_blog') is highly correlated with ('P31', 'do_not_know_data_hackers')High correlation
('P31', 'data_hackers_podcast') is highly correlated with ('P31', 'do_not_know_data_hackers')High correlation
('P31', 'do_not_know_data_hackers') is highly correlated with ('P31', 'data_hackers_blog') and 1 other fieldsHigh correlation
('P1', 'age') is highly correlated with ('P8', 'degreee_level') and 2 other fieldsHigh correlation
('P2', 'gender') is highly correlated with sexoHigh correlation
('P5', 'living_state') is highly correlated with ('D1', 'living_macroregion')High correlation
('P8', 'degreee_level') is highly correlated with ('P1', 'age') and 2 other fieldsHigh correlation
('P10', 'job_situation') is highly correlated with ('P16', 'salary_range') and 3 other fieldsHigh correlation
('P12', 'workers_number') is highly correlated with ('P29', 'have_data_warehouse') and 2 other fieldsHigh correlation
('P13', 'manager') is highly correlated with gestorHigh correlation
('P16', 'salary_range') is highly correlated with ('P10', 'job_situation') and 4 other fieldsHigh correlation
('P17', 'time_experience_data_science') is highly correlated with experiencia_dsHigh correlation
('P18', 'time_experience_before') is highly correlated with ('P35', 'other')High correlation
('P19', 'is_data_science_professional') is highly correlated with ('P20', 'linear_regression') and 31 other fieldsHigh correlation
('P20', 'linear_regression') is highly correlated with ('P19', 'is_data_science_professional') and 27 other fieldsHigh correlation
('P20', 'logistic_regression') is highly correlated with ('P19', 'is_data_science_professional') and 23 other fieldsHigh correlation
('P20', 'glms') is highly correlated with ('P20', 'linear_regression') and 9 other fieldsHigh correlation
('P20', 'decision_tree') is highly correlated with ('P19', 'is_data_science_professional') and 24 other fieldsHigh correlation
('P20', 'random_forest') is highly correlated with ('P19', 'is_data_science_professional') and 23 other fieldsHigh correlation
('P20', 'neural_networks') is highly correlated with ('P19', 'is_data_science_professional') and 19 other fieldsHigh correlation
('P20', 'bayesian_inference') is highly correlated with ('P19', 'is_data_science_professional') and 12 other fieldsHigh correlation
('P20', 'ensemble') is highly correlated with ('P20', 'linear_regression') and 14 other fieldsHigh correlation
('P20', 'svms') is highly correlated with ('P20', 'linear_regression') and 13 other fieldsHigh correlation
('P20', 'cnns') is highly correlated with ('P20', 'neural_networks') and 7 other fieldsHigh correlation
('P20', 'rnns') is highly correlated with ('P20', 'random_forest') and 8 other fieldsHigh correlation
('P20', 'hmms') is highly correlated with ('P20', 'markov_chains')High correlation
('P20', 'gans') is highly correlated with ('P20', 'cnns') and 1 other fieldsHigh correlation
('P20', 'markov_chains') is highly correlated with ('P20', 'bayesian_inference') and 2 other fieldsHigh correlation
('P20', 'nlp') is highly correlated with ('P19', 'is_data_science_professional') and 18 other fieldsHigh correlation
('P20', 'gradient_boosted_machines') is highly correlated with ('P20', 'linear_regression') and 13 other fieldsHigh correlation
('P20', 'cluster_analysis') is highly correlated with ('P19', 'is_data_science_professional') and 21 other fieldsHigh correlation
('P20', 'joint analysis') is highly correlated with ('P35', 'other')High correlation
('P20', 'no_listed_methods') is highly correlated with ('P19', 'is_data_science_professional') and 2 other fieldsHigh correlation
('P21', 'sql_') is highly correlated with ('P19', 'is_data_science_professional') and 24 other fieldsHigh correlation
('P21', 'r') is highly correlated with ('P19', 'is_data_science_professional') and 13 other fieldsHigh correlation
('P21', 'python') is highly correlated with ('P19', 'is_data_science_professional') and 31 other fieldsHigh correlation
('P21', 'c_c++_c#') is highly correlated with ('P21', 'dotnet')High correlation
('P21', 'dotnet') is highly correlated with ('P21', 'c_c++_c#')High correlation
('P21', 'sas_stata') is highly correlated with ('P22', 'most_used_proggraming_languages') and 1 other fieldsHigh correlation
('P21', 'matlab') is highly correlated with ('P35', 'other')High correlation
('P21', 'no_listed_languages') is highly correlated with ('P22', 'most_used_proggraming_languages')High correlation
('P22', 'most_used_proggraming_languages') is highly correlated with ('P21', 'sas_stata') and 4 other fieldsHigh correlation
('P23', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 30 other fieldsHigh correlation
('P23', 'nosql') is highly correlated with ('P19', 'is_data_science_professional') and 21 other fieldsHigh correlation
('P23', 'images') is highly correlated with ('P20', 'neural_networks') and 6 other fieldsHigh correlation
('P23', 'nlp') is highly correlated with ('P19', 'is_data_science_professional') and 18 other fieldsHigh correlation
('P23', 'videos') is highly correlated with ('P20', 'gans') and 1 other fieldsHigh correlation
('P23', 'sheets') is highly correlated with ('P19', 'is_data_science_professional') and 23 other fieldsHigh correlation
('P23', 'other') is highly correlated with ('P24', 'other')High correlation
('P24', 'sql') is highly correlated with ('P19', 'is_data_science_professional') and 22 other fieldsHigh correlation
('P24', 'nosql') is highly correlated with ('P23', 'nosql')High correlation
('P24', 'imagens') is highly correlated with ('P20', 'cnns') and 2 other fieldsHigh correlation
('P24', 'nlp') is highly correlated with ('P20', 'nlp') and 2 other fieldsHigh correlation
('P24', 'planilhas') is highly correlated with ('P19', 'is_data_science_professional') and 6 other fieldsHigh correlation
('P24', 'other') is highly correlated with ('P23', 'other') and 1 other fieldsHigh correlation
('P25', 'aws') is highly correlated with ('P19', 'is_data_science_professional') and 17 other fieldsHigh correlation
('P25', 'gcp') is highly correlated with ('P19', 'is_data_science_professional') and 5 other fieldsHigh correlation
('P25', 'azure') is highly correlated with ('P26', 'sql_server') and 2 other fieldsHigh correlation
('P25', 'on_premise_servers') is highly correlated with ('P35', 'other')High correlation
('P26', 'mysql') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P26', 'oracle') is highly correlated with ('P19', 'is_data_science_professional') and 5 other fieldsHigh correlation
('P26', 'sql_server') is highly correlated with ('P19', 'is_data_science_professional') and 10 other fieldsHigh correlation
('P26', 'dynamodb') is highly correlated with ('P28', 'aws_glue') and 1 other fieldsHigh correlation
('P26', 'mongodb') is highly correlated with ('P19', 'is_data_science_professional') and 6 other fieldsHigh correlation
('P26', 's3') is highly correlated with ('P21', 'python') and 6 other fieldsHigh correlation
('P26', 'postgresql') is highly correlated with ('P19', 'is_data_science_professional') and 12 other fieldsHigh correlation
('P26', 'elaticsearch') is highly correlated with ('P23', 'nosql') and 1 other fieldsHigh correlation
('P26', 'other') is highly correlated with ('P35', 'other')High correlation
('P27', 'microsoft_powerbi') is highly correlated with ('P19', 'is_data_science_professional') and 13 other fieldsHigh correlation
('P27', 'tableau') is highly correlated with ('P19', 'is_data_science_professional') and 4 other fieldsHigh correlation
('P27', 'superset') is highly correlated with ('P35', 'other')High correlation
('P27', 'redash') is highly correlated with ('P35', 'other')High correlation
('P27', 'microstrategy') is highly correlated with ('P35', 'other')High correlation
('P27', 'sap_business_objects') is highly correlated with ('P28', 'sap_bw_etl') and 1 other fieldsHigh correlation
('P27', 'google_data_studio') is highly correlated with ('P25', 'gcp') and 2 other fieldsHigh correlation
('P27', 'only_excel_gsheets') is highly correlated with ('P35', 'other')High correlation
('P27', 'other') is highly correlated with ('P35', 'other')High correlation
('P28', 'sql_&_stored_procedures') is highly correlated with ('P19', 'is_data_science_professional') and 7 other fieldsHigh correlation
('P28', 'apache_airflow') is highly correlated with ('P25', 'aws') and 1 other fieldsHigh correlation
('P28', 'aws_glue') is highly correlated with ('P25', 'aws') and 3 other fieldsHigh correlation
('P28', 'pentaho') is highly correlated with ('P24', 'sql')High correlation
('P28', 'oracle_data_integrator') is highly correlated with ('P35', 'other')High correlation
('P28', 'sap_bw_etl') is highly correlated with ('P27', 'sap_business_objects') and 1 other fieldsHigh correlation
('P28', 'siss_sql_server_integration_services') is highly correlated with ('P26', 'sql_server') and 2 other fieldsHigh correlation
('P28', 'other') is highly correlated with ('P19', 'is_data_science_professional') and 1 other fieldsHigh correlation
('P29', 'have_data_warehouse') is highly correlated with ('P12', 'workers_number')High correlation
('P30', 'google_bigquery') is highly correlated with ('P25', 'gcp') and 2 other fieldsHigh correlation
('P30', 'aws_redshift') is highly correlated with ('P25', 'aws') and 2 other fieldsHigh correlation
('P30', 'oracle') is highly correlated with ('P26', 'oracle')High correlation
('P30', 'postgres_mysql') is highly correlated with ('P26', 'postgresql') and 1 other fieldsHigh correlation
('P30', 'microsoft_azure') is highly correlated with ('P25', 'azure')High correlation
('P31', 'data_hackers_blog') is highly correlated with ('P31', 'data_hackers_podcast') and 3 other fieldsHigh correlation
('P31', 'data_hackers_podcast') is highly correlated with ('P31', 'data_hackers_blog') and 3 other fieldsHigh correlation
('P31', 'weekly_newsletter') is highly correlated with ('P31', 'do_not_know_data_hackers') and 1 other fieldsHigh correlation
('P31', 'slack_channel') is highly correlated with ('P31', 'data_hackers_blog') and 3 other fieldsHigh correlation
('P31', 'data_hackers_bootcamp') is highly correlated with ('P35', 'other')High correlation
('P31', 'do_not_know_data_hackers') is highly correlated with ('P31', 'data_hackers_blog') and 4 other fieldsHigh correlation
('P32', 'prefered_data_hackers_initiative') is highly correlated with ('P31', 'data_hackers_blog') and 5 other fieldsHigh correlation
('P33', 'other') is highly correlated with ('P35', 'other')High correlation
('P34', 'udacity') is highly correlated with ('P35', 'data_science_plataforms_preference') and 1 other fieldsHigh correlation
('P34', 'coursera') is highly correlated with ('P35', 'data_science_plataforms_preference') and 1 other fieldsHigh correlation
('P34', 'udemy') is highly correlated with ('P34', 'online_courses') and 2 other fieldsHigh correlation
('P34', 'height') is highly correlated with ('P35', 'data_science_plataforms_preference') and 1 other fieldsHigh correlation
('P34', 'data_camp') is highly correlated with ('P35', 'data_science_plataforms_preference') and 1 other fieldsHigh correlation
('P34', 'data_quest') is highly correlated with ('P35', 'data_science_plataforms_preference') and 1 other fieldsHigh correlation
('P34', 'online_courses') is highly correlated with ('P34', 'udemy') and 3 other fieldsHigh correlation
('P34', 'other') is highly correlated with ('P35', 'other')High correlation
('P35', 'data_science_plataforms_preference') is highly correlated with ('P34', 'udacity') and 7 other fieldsHigh correlation
('P35', 'other') is highly correlated with ('P8', 'degreee_level') and 45 other fieldsHigh correlation
('D1', 'living_macroregion') is highly correlated with ('P5', 'living_state')High correlation
('D2', 'origin_macroregion') is highly correlated with ('P35', 'other')High correlation
('D3', 'anonymized_degree_area') is highly correlated with ('P35', 'other') and 3 other fieldsHigh correlation
('D4', 'anonymized_market_sector') is highly correlated with ('P35', 'other') and 1 other fieldsHigh correlation
('D5', 'anonymized_manager_level') is highly correlated with ('P12', 'workers_number') and 1 other fieldsHigh correlation
('D6', 'anonymized_role') is highly correlated with ('P19', 'is_data_science_professional') and 24 other fieldsHigh correlation
profissao is highly correlated with ('P19', 'is_data_science_professional') and 28 other fieldsHigh correlation
idade is highly correlated with ('P1', 'age') and 2 other fieldsHigh correlation
salario is highly correlated with ('P16', 'salary_range') and 1 other fieldsHigh correlation
tamanho_da_empresa is highly correlated with ('P10', 'job_situation') and 2 other fieldsHigh correlation
gestor is highly correlated with ('P13', 'manager')High correlation
se_considera_ds is highly correlated with ('P19', 'is_data_science_professional') and 31 other fieldsHigh correlation
sexo is highly correlated with ('P2', 'gender')High correlation
experiencia_ds is highly correlated with ('P17', 'time_experience_data_science')High correlation
tipo_de_trabalho is highly correlated with ('P10', 'job_situation') and 3 other fieldsHigh correlation
escolaridade is highly correlated with ('P1', 'age') and 2 other fieldsHigh correlation
area_de_formacao is highly correlated with ('P35', 'other') and 3 other fieldsHigh correlation
setor_de_mercado is highly correlated with ('P35', 'other') and 1 other fieldsHigh correlation
plataforma_favorita is highly correlated with ('P34', 'udacity') and 7 other fieldsHigh correlation
('P1', 'age') has 24 (1.4%) missing values Missing
('P5', 'living_state') has 337 (19.1%) missing values Missing
('P6', 'born_or_graduated') has 34 (1.9%) missing values Missing
('P12', 'workers_number') has 238 (13.5%) missing values Missing
('P13', 'manager') has 238 (13.5%) missing values Missing
('P16', 'salary_range') has 238 (13.5%) missing values Missing
('P22', 'most_used_proggraming_languages') has 859 (48.7%) missing values Missing
('P29', 'have_data_warehouse') has 972 (55.1%) missing values Missing
('P35', 'data_science_plataforms_preference') has 140 (7.9%) missing values Missing
('P35', 'other') has 1625 (92.1%) missing values Missing
('D1', 'living_macroregion') has 337 (19.1%) missing values Missing
('D2', 'origin_macroregion') has 1440 (81.6%) missing values Missing
('D3', 'anonymized_degree_area') has 35 (2.0%) missing values Missing
('D4', 'anonymized_market_sector') has 243 (13.8%) missing values Missing
('D5', 'anonymized_manager_level') has 1460 (82.7%) missing values Missing
('D6', 'anonymized_role') has 514 (29.1%) missing values Missing
profissao has 821 (46.5%) missing values Missing
idade has 24 (1.4%) missing values Missing
salario has 238 (13.5%) missing values Missing
tamanho_da_empresa has 366 (20.7%) missing values Missing
gestor has 238 (13.5%) missing values Missing
area_de_formacao has 35 (2.0%) missing values Missing
setor_de_mercado has 243 (13.8%) missing values Missing
plataforma_favorita has 140 (7.9%) missing values Missing
('P0', 'id') is uniformly distributed Uniform
('P0', 'id') has unique values Unique

Reproduction

Analysis started2022-06-30 19:00:29.967592
Analysis finished2022-06-30 19:02:52.911080
Duration2 minutes and 22.94 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

('P0', 'id')
Categorical

HIGH CARDINALITY
UNIFORM
UNIQUE

Distinct1765
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
v9otv8j9wdvjrv9otvwnn9owhzq54ktv
 
1
lpbg52wujbplior6u1sfz6jz5y1sv9bi
 
1
0adf1nmon3qtu5px50adf1bcvbyyb019
 
1
rkoj6cjowueyh7aoixg4rkoj6cysaxxj
 
1
4hsf2rmxc33u5xknzghb04hsf2rm2dpn
 
1
Other values (1760)
1760 

Length

Max length32
Median length32
Mean length32
Min length32

Characters and Unicode

Total characters56480
Distinct characters36
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1765 ?
Unique (%)100.0%

Sample

1st rowv9otv8j9wdvjrv9otvwnn9owhzq54ktv
2nd row875ul998t0hqcv0871uptwf3oswcfv35
3rd rowpuscuk079vw1pusbb900pzw2xvpxtgdk
4th rowrmel8ewqpbffp2mnfbzermel8eqincov
5th rowpj9mgud4d6mdct1l7vq0pj9mgu78h6ju

Common Values

ValueCountFrequency (%)
v9otv8j9wdvjrv9otvwnn9owhzq54ktv1
 
0.1%
lpbg52wujbplior6u1sfz6jz5y1sv9bi1
 
0.1%
0adf1nmon3qtu5px50adf1bcvbyyb0191
 
0.1%
rkoj6cjowueyh7aoixg4rkoj6cysaxxj1
 
0.1%
4hsf2rmxc33u5xknzghb04hsf2rm2dpn1
 
0.1%
rr6fskzr7o0o8b5isbzjx9rr6fsr0qqa1
 
0.1%
6jecdg2anwple9poilukd6jecdg475u41
 
0.1%
0bk93xav7ytauebjcilauuoe0bk9365a1
 
0.1%
ay3tk9zie59nqnhay3tk9z5agwv27oln1
 
0.1%
ufbj66nszpl6ufbjg7zhqu6w1wq62t101
 
0.1%
Other values (1755)1755
99.4%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
v9otv8j9wdvjrv9otvwnn9owhzq54ktv1
 
0.1%
e3ee6zjc4cd4e3efrp52vkmx4grcs6pu1
 
0.1%
rmel8ewqpbffp2mnfbzermel8eqincov1
 
0.1%
pj9mgud4d6mdct1l7vq0pj9mgu78h6ju1
 
0.1%
cb7n2v7372y97wl1lcb7n2e8tpockl831
 
0.1%
ayev5viaqe43pxxqqxayev5vtvdba6191
 
0.1%
se3czgy682ew760hzvhmvsse3czpwozo1
 
0.1%
h5zvhct1kbmbb49h5zgzb3c1ce6lnrl61
 
0.1%
4t388yqrekd1gsq4t388b9gqkmt2z86x1
 
0.1%
yow87k0qg2lcld30dfxeuyow87k01w1w1
 
0.1%
Other values (1755)1755
99.4%

Most occurring characters

ValueCountFrequency (%)
e1650
 
2.9%
o1631
 
2.9%
z1629
 
2.9%
21627
 
2.9%
01623
 
2.9%
91613
 
2.9%
f1611
 
2.9%
j1607
 
2.8%
q1594
 
2.8%
u1592
 
2.8%
Other values (26)40303
71.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter40759
72.2%
Decimal Number15721
 
27.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e1650
 
4.0%
o1631
 
4.0%
z1629
 
4.0%
f1611
 
4.0%
j1607
 
3.9%
q1594
 
3.9%
u1592
 
3.9%
d1587
 
3.9%
a1586
 
3.9%
v1582
 
3.9%
Other values (16)24690
60.6%
Decimal Number
ValueCountFrequency (%)
21627
10.3%
01623
10.3%
91613
10.3%
31586
10.1%
51564
9.9%
61555
9.9%
81551
9.9%
41550
9.9%
11526
9.7%
71526
9.7%

Most occurring scripts

ValueCountFrequency (%)
Latin40759
72.2%
Common15721
 
27.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e1650
 
4.0%
o1631
 
4.0%
z1629
 
4.0%
f1611
 
4.0%
j1607
 
3.9%
q1594
 
3.9%
u1592
 
3.9%
d1587
 
3.9%
a1586
 
3.9%
v1582
 
3.9%
Other values (16)24690
60.6%
Common
ValueCountFrequency (%)
21627
10.3%
01623
10.3%
91613
10.3%
31586
10.1%
51564
9.9%
61555
9.9%
81551
9.9%
41550
9.9%
11526
9.7%
71526
9.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII56480
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e1650
 
2.9%
o1631
 
2.9%
z1629
 
2.9%
21627
 
2.9%
01623
 
2.9%
91613
 
2.9%
f1611
 
2.9%
j1607
 
2.8%
q1594
 
2.8%
u1592
 
2.8%
Other values (26)40303
71.4%

('P1', 'age')
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct33
Distinct (%)1.9%
Missing24
Missing (%)1.4%
Infinite0
Infinite (%)0.0%
Mean29.80068926
Minimum18
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.9 KiB

Quantile statistics

Minimum18
5-th percentile21
Q125
median29
Q334
95-th percentile43
Maximum50
Range32
Interquartile range (IQR)9

Descriptive statistics

Standard deviation6.595794514
Coefficient of variation (CV)0.2213302671
Kurtosis0.1479533151
Mean29.80068926
Median Absolute Deviation (MAD)4
Skewness0.7315076121
Sum51883
Variance43.50450527
MonotonicityNot monotonic
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
27119
 
6.7%
25118
 
6.7%
29108
 
6.1%
30107
 
6.1%
26101
 
5.7%
2897
 
5.5%
2397
 
5.5%
2294
 
5.3%
2489
 
5.0%
3188
 
5.0%
Other values (23)723
41.0%
ValueCountFrequency (%)
188
 
0.5%
1914
 
0.8%
2039
 
2.2%
2158
3.3%
2294
5.3%
2397
5.5%
2489
5.0%
25118
6.7%
26101
5.7%
27119
6.7%
ValueCountFrequency (%)
5010
 
0.6%
496
 
0.3%
484
 
0.2%
4715
0.8%
469
 
0.5%
4518
1.0%
4412
0.7%
4317
1.0%
4216
0.9%
4126
1.5%

('P2', 'gender')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing3
Missing (%)0.2%
Memory size13.9 KiB
Masculino
1436 
Feminino
326 

Length

Max length9
Median length9
Mean length8.814982974
Min length8

Characters and Unicode

Total characters15532
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMasculino
2nd rowFeminino
3rd rowMasculino
4th rowMasculino
5th rowMasculino

Common Values

ValueCountFrequency (%)
Masculino1436
81.4%
Feminino326
 
18.5%
(Missing)3
 
0.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
masculino1436
81.5%
feminino326
 
18.5%

Most occurring characters

ValueCountFrequency (%)
i2088
13.4%
n2088
13.4%
o1762
11.3%
M1436
9.2%
a1436
9.2%
s1436
9.2%
c1436
9.2%
u1436
9.2%
l1436
9.2%
F326
 
2.1%
Other values (2)652
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter13770
88.7%
Uppercase Letter1762
 
11.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i2088
15.2%
n2088
15.2%
o1762
12.8%
a1436
10.4%
s1436
10.4%
c1436
10.4%
u1436
10.4%
l1436
10.4%
e326
 
2.4%
m326
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
M1436
81.5%
F326
 
18.5%

Most occurring scripts

ValueCountFrequency (%)
Latin15532
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i2088
13.4%
n2088
13.4%
o1762
11.3%
M1436
9.2%
a1436
9.2%
s1436
9.2%
c1436
9.2%
u1436
9.2%
l1436
9.2%
F326
 
2.1%
Other values (2)652
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII15532
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i2088
13.4%
n2088
13.4%
o1762
11.3%
M1436
9.2%
a1436
9.2%
s1436
9.2%
c1436
9.2%
u1436
9.2%
l1436
9.2%
F326
 
2.1%
Other values (2)652
 
4.2%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
1
1731 
0
 
34

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
11731
98.1%
034
 
1.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
11731
98.1%
034
 
1.9%

Most occurring characters

ValueCountFrequency (%)
11731
98.1%
034
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
11731
98.1%
034
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
11731
98.1%
034
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11731
98.1%
034
 
1.9%

('P5', 'living_state')
Categorical

HIGH CORRELATION
MISSING

Distinct7
Distinct (%)0.5%
Missing337
Missing (%)19.1%
Memory size13.9 KiB
São Paulo (SP)
669 
Minas Gerais (MG)
316 
Rio de Janeiro (RJ)
147 
Paraná (PR)
117 
Santa Catarina (SC)
83 
Other values (2)
96 

Length

Max length22
Median length19
Mean length15.70448179
Min length11

Characters and Unicode

Total characters22426
Distinct characters26
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMinas Gerais (MG)
2nd rowSão Paulo (SP)
3rd rowSão Paulo (SP)
4th rowSão Paulo (SP)
5th rowParaná (PR)

Common Values

ValueCountFrequency (%)
São Paulo (SP)669
37.9%
Minas Gerais (MG)316
17.9%
Rio de Janeiro (RJ)147
 
8.3%
Paraná (PR)117
 
6.6%
Santa Catarina (SC)83
 
4.7%
Rio Grande do Sul (RS)69
 
3.9%
Espírito Santo (ES)27
 
1.5%
(Missing)337
19.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
são669
15.0%
paulo669
15.0%
sp669
15.0%
minas316
 
7.1%
gerais316
 
7.1%
mg316
 
7.1%
rio216
 
4.9%
de147
 
3.3%
janeiro147
 
3.3%
rj147
 
3.3%
Other values (12)840
18.9%

Most occurring characters

ValueCountFrequency (%)
3024
13.5%
a2193
 
9.8%
o1824
 
8.1%
S1696
 
7.6%
P1572
 
7.0%
(1428
 
6.4%
)1428
 
6.4%
i1105
 
4.9%
n842
 
3.8%
r759
 
3.4%
Other values (16)6555
29.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter10882
48.5%
Uppercase Letter5664
25.3%
Space Separator3024
 
13.5%
Open Punctuation1428
 
6.4%
Close Punctuation1428
 
6.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a2193
20.2%
o1824
16.8%
i1105
10.2%
n842
 
7.7%
r759
 
7.0%
u738
 
6.8%
l738
 
6.8%
e679
 
6.2%
ã669
 
6.1%
s659
 
6.1%
Other values (5)676
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
S1696
29.9%
P1572
27.8%
G701
12.4%
M632
 
11.2%
R549
 
9.7%
J294
 
5.2%
C166
 
2.9%
E54
 
1.0%
Space Separator
ValueCountFrequency (%)
3024
100.0%
Open Punctuation
ValueCountFrequency (%)
(1428
100.0%
Close Punctuation
ValueCountFrequency (%)
)1428
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin16546
73.8%
Common5880
 
26.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a2193
13.3%
o1824
11.0%
S1696
 
10.3%
P1572
 
9.5%
i1105
 
6.7%
n842
 
5.1%
r759
 
4.6%
u738
 
4.5%
l738
 
4.5%
G701
 
4.2%
Other values (13)4378
26.5%
Common
ValueCountFrequency (%)
3024
51.4%
(1428
24.3%
)1428
24.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII21613
96.4%
None813
 
3.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3024
14.0%
a2193
 
10.1%
o1824
 
8.4%
S1696
 
7.8%
P1572
 
7.3%
(1428
 
6.6%
)1428
 
6.6%
i1105
 
5.1%
n842
 
3.9%
r759
 
3.5%
Other values (13)5742
26.6%
None
ValueCountFrequency (%)
ã669
82.3%
á117
 
14.4%
í27
 
3.3%

('P6', 'born_or_graduated')
Categorical

MISSING

Distinct2
Distinct (%)0.1%
Missing34
Missing (%)1.9%
Memory size13.9 KiB
1.0
1403 
0.0
328 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters5193
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row1.0
3rd row1.0
4th row0.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.01403
79.5%
0.0328
 
18.6%
(Missing)34
 
1.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.01403
81.1%
0.0328
 
18.9%

Most occurring characters

ValueCountFrequency (%)
02059
39.6%
.1731
33.3%
11403
27.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3462
66.7%
Other Punctuation1731
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
02059
59.5%
11403
40.5%
Other Punctuation
ValueCountFrequency (%)
.1731
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common5193
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
02059
39.6%
.1731
33.3%
11403
27.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII5193
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
02059
39.6%
.1731
33.3%
11403
27.0%

('P8', 'degreee_level')
Categorical

HIGH CORRELATION

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Graduação/Bacharelado
578 
Pós-graduação
527 
Estudante de Graduação
374 
Mestrado
201 
Doutorado ou Phd
 
50
Other values (2)
 
35

Length

Max length26
Median length22
Mean length17.29688385
Min length8

Characters and Unicode

Total characters30529
Distinct characters29
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowEstudante de Graduação
2nd rowEstudante de Graduação
3rd rowGraduação/Bacharelado
4th rowEstudante de Graduação
5th rowGraduação/Bacharelado

Common Values

ValueCountFrequency (%)
Graduação/Bacharelado578
32.7%
Pós-graduação527
29.9%
Estudante de Graduação374
21.2%
Mestrado201
 
11.4%
Doutorado ou Phd50
 
2.8%
Não tenho graduação formal34
 
1.9%
Prefiro não informar1
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
graduação/bacharelado578
21.3%
pós-graduação527
19.4%
graduação408
15.0%
estudante374
13.8%
de374
13.8%
mestrado201
 
7.4%
doutorado50
 
1.8%
ou50
 
1.8%
phd50
 
1.8%
não35
 
1.3%
Other values (4)70
 
2.6%

Most occurring characters

ValueCountFrequency (%)
a5420
17.8%
d3140
 
10.3%
o2597
 
8.5%
r2380
 
7.8%
u1987
 
6.5%
e1562
 
5.1%
ã1548
 
5.1%
ç1513
 
5.0%
s1102
 
3.6%
t1033
 
3.4%
Other values (19)8247
27.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter25705
84.2%
Uppercase Letter2767
 
9.1%
Space Separator952
 
3.1%
Other Punctuation578
 
1.9%
Dash Punctuation527
 
1.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a5420
21.1%
d3140
12.2%
o2597
10.1%
r2380
9.3%
u1987
 
7.7%
e1562
 
6.1%
ã1548
 
6.0%
ç1513
 
5.9%
s1102
 
4.3%
t1033
 
4.0%
Other values (9)3423
13.3%
Uppercase Letter
ValueCountFrequency (%)
G952
34.4%
P578
20.9%
B578
20.9%
E374
 
13.5%
M201
 
7.3%
D50
 
1.8%
N34
 
1.2%
Space Separator
ValueCountFrequency (%)
952
100.0%
Other Punctuation
ValueCountFrequency (%)
/578
100.0%
Dash Punctuation
ValueCountFrequency (%)
-527
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin28472
93.3%
Common2057
 
6.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a5420
19.0%
d3140
11.0%
o2597
9.1%
r2380
 
8.4%
u1987
 
7.0%
e1562
 
5.5%
ã1548
 
5.4%
ç1513
 
5.3%
s1102
 
3.9%
t1033
 
3.6%
Other values (16)6190
21.7%
Common
ValueCountFrequency (%)
952
46.3%
/578
28.1%
-527
25.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII26941
88.2%
None3588
 
11.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a5420
20.1%
d3140
11.7%
o2597
9.6%
r2380
8.8%
u1987
 
7.4%
e1562
 
5.8%
s1102
 
4.1%
t1033
 
3.8%
G952
 
3.5%
952
 
3.5%
Other values (16)5816
21.6%
None
ValueCountFrequency (%)
ã1548
43.1%
ç1513
42.2%
ó527
 
14.7%

('P10', 'job_situation')
Categorical

HIGH CORRELATION

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Empregado (CTL)
1073 
Empreendedor ou Empregado (CNPJ)
234 
Estagiário
131 
Somente Estudante (graduação)
 
85
Desempregado, buscando recolocação
 
69
Other values (6)
173 

Length

Max length45
Median length15
Mean length19.27988669
Min length10

Characters and Unicode

Total characters34029
Distinct characters44
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEmpregado (CTL)
2nd rowEmpregado (CTL)
3rd rowEmpregado (CTL)
4th rowEstagiário
5th rowFreelancer

Common Values

ValueCountFrequency (%)
Empregado (CTL)1073
60.8%
Empreendedor ou Empregado (CNPJ)234
 
13.3%
Estagiário131
 
7.4%
Somente Estudante (graduação)85
 
4.8%
Desempregado, buscando recolocação69
 
3.9%
Servidor Público60
 
3.4%
Trabalho na área Acadêmica/Pesquisador45
 
2.5%
Somente Estudante (pós-graduação)36
 
2.0%
Freelancer23
 
1.3%
Prefiro não dizer6
 
0.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
empregado1307
31.6%
ctl1073
25.9%
empreendedor234
 
5.6%
ou234
 
5.6%
cnpj234
 
5.6%
estagiário131
 
3.2%
somente121
 
2.9%
estudante121
 
2.9%
graduação85
 
2.1%
desempregado72
 
1.7%
Other values (15)530
12.8%

Most occurring characters

ValueCountFrequency (%)
e2897
 
8.5%
o2736
 
8.0%
r2490
 
7.3%
2377
 
7.0%
a2355
 
6.9%
d2317
 
6.8%
E1793
 
5.3%
m1779
 
5.2%
p1649
 
4.8%
g1631
 
4.8%
Other values (34)12005
35.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter22221
65.3%
Uppercase Letter6425
 
18.9%
Space Separator2377
 
7.0%
Open Punctuation1428
 
4.2%
Close Punctuation1428
 
4.2%
Other Punctuation114
 
0.3%
Dash Punctuation36
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e2897
13.0%
o2736
12.3%
r2490
11.2%
a2355
10.6%
d2317
10.4%
m1779
8.0%
p1649
7.4%
g1631
7.3%
n625
 
2.8%
u596
 
2.7%
Other values (17)3146
14.2%
Uppercase Letter
ValueCountFrequency (%)
E1793
27.9%
C1307
20.3%
T1118
17.4%
L1073
16.7%
P345
 
5.4%
J234
 
3.6%
N234
 
3.6%
S181
 
2.8%
D72
 
1.1%
A45
 
0.7%
Other Punctuation
ValueCountFrequency (%)
,69
60.5%
/45
39.5%
Space Separator
ValueCountFrequency (%)
2377
100.0%
Open Punctuation
ValueCountFrequency (%)
(1428
100.0%
Close Punctuation
ValueCountFrequency (%)
)1428
100.0%
Dash Punctuation
ValueCountFrequency (%)
-36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin28646
84.2%
Common5383
 
15.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e2897
 
10.1%
o2736
 
9.6%
r2490
 
8.7%
a2355
 
8.2%
d2317
 
8.1%
E1793
 
6.3%
m1779
 
6.2%
p1649
 
5.8%
g1631
 
5.7%
C1307
 
4.6%
Other values (28)7692
26.9%
Common
ValueCountFrequency (%)
2377
44.2%
(1428
26.5%
)1428
26.5%
,69
 
1.3%
/45
 
0.8%
-36
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII33317
97.9%
None712
 
2.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e2897
 
8.7%
o2736
 
8.2%
r2490
 
7.5%
2377
 
7.1%
a2355
 
7.1%
d2317
 
7.0%
E1793
 
5.4%
m1779
 
5.3%
p1649
 
4.9%
g1631
 
4.9%
Other values (28)11293
33.9%
None
ValueCountFrequency (%)
ã202
28.4%
ç193
27.1%
á176
24.7%
ú60
 
8.4%
ê45
 
6.3%
ó36
 
5.1%

('P12', 'workers_number')
Categorical

HIGH CORRELATION
MISSING

Distinct8
Distinct (%)0.5%
Missing238
Missing (%)13.5%
Memory size13.9 KiB
Acima de 3000
393 
de 101 a 500
333 
de 11 a 50
204 
de 501 a 1000
172 
de 1001 a 3000
164 
Other values (3)
261 

Length

Max length14
Median length13
Mean length11.92534381
Min length8

Characters and Unicode

Total characters18210
Distinct characters13
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowde 1 a 5
2nd rowAcima de 3000
3rd rowAcima de 3000
4th rowde 11 a 50
5th rowde 6 a 10

Common Values

ValueCountFrequency (%)
Acima de 3000393
22.3%
de 101 a 500333
18.9%
de 11 a 50204
11.6%
de 501 a 1000172
9.7%
de 1001 a 3000164
9.3%
de 51 a 100128
 
7.3%
de 1 a 572
 
4.1%
de 6 a 1061
 
3.5%
(Missing)238
13.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
de1527
26.7%
a1134
19.8%
3000557
 
9.7%
acima393
 
6.9%
101333
 
5.8%
500333
 
5.8%
11204
 
3.6%
50204
 
3.6%
1000172
 
3.0%
501172
 
3.0%
Other values (7)686
12.0%

Most occurring characters

ValueCountFrequency (%)
04207
23.1%
4188
23.0%
12135
11.7%
a1527
 
8.4%
d1527
 
8.4%
e1527
 
8.4%
5909
 
5.0%
3557
 
3.1%
A393
 
2.2%
c393
 
2.2%
Other values (3)847
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7869
43.2%
Lowercase Letter5760
31.6%
Space Separator4188
23.0%
Uppercase Letter393
 
2.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a1527
26.5%
d1527
26.5%
e1527
26.5%
c393
 
6.8%
i393
 
6.8%
m393
 
6.8%
Decimal Number
ValueCountFrequency (%)
04207
53.5%
12135
27.1%
5909
 
11.6%
3557
 
7.1%
661
 
0.8%
Space Separator
ValueCountFrequency (%)
4188
100.0%
Uppercase Letter
ValueCountFrequency (%)
A393
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common12057
66.2%
Latin6153
33.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a1527
24.8%
d1527
24.8%
e1527
24.8%
A393
 
6.4%
c393
 
6.4%
i393
 
6.4%
m393
 
6.4%
Common
ValueCountFrequency (%)
04207
34.9%
4188
34.7%
12135
17.7%
5909
 
7.5%
3557
 
4.6%
661
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII18210
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
04207
23.1%
4188
23.0%
12135
11.7%
a1527
 
8.4%
d1527
 
8.4%
e1527
 
8.4%
5909
 
5.0%
3557
 
3.1%
A393
 
2.2%
c393
 
2.2%
Other values (3)847
 
4.7%

('P13', 'manager')
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.1%
Missing238
Missing (%)13.5%
Memory size13.9 KiB
0.0
1222 
1.0
305 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters4581
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row1.0

Common Values

ValueCountFrequency (%)
0.01222
69.2%
1.0305
 
17.3%
(Missing)238
 
13.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0.01222
80.0%
1.0305
 
20.0%

Most occurring characters

ValueCountFrequency (%)
02749
60.0%
.1527
33.3%
1305
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3054
66.7%
Other Punctuation1527
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
02749
90.0%
1305
 
10.0%
Other Punctuation
ValueCountFrequency (%)
.1527
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common4581
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
02749
60.0%
.1527
33.3%
1305
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII4581
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
02749
60.0%
.1527
33.3%
1305
 
6.7%

('P16', 'salary_range')
Categorical

HIGH CORRELATION
MISSING

Distinct11
Distinct (%)0.7%
Missing238
Missing (%)13.5%
Memory size13.9 KiB
de R$ 4.001/mês a R$ 6.000/mês
308 
de R$ 8.001/mês a R$ 12.000/mês
237 
de R$ 6.001/mês a R$ 8.000/mês
229 
de R$ 3.001/mês a R$ 4.000/mês
219 
de R$ 1.001/mês a R$ 2.000/mês
181 
Other values (6)
353 

Length

Max length32
Median length30
Mean length29.89194499
Min length21

Characters and Unicode

Total characters45645
Distinct characters25
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowde R$ 1.001/mês a R$ 2.000/mês
2nd rowde R$ 2.001/mês a R$ 3000/mês
3rd rowde R$ 4.001/mês a R$ 6.000/mês
4th rowde R$ 1.001/mês a R$ 2.000/mês
5th rowde R$ 6.001/mês a R$ 8.000/mês

Common Values

ValueCountFrequency (%)
de R$ 4.001/mês a R$ 6.000/mês308
17.5%
de R$ 8.001/mês a R$ 12.000/mês237
13.4%
de R$ 6.001/mês a R$ 8.000/mês229
13.0%
de R$ 3.001/mês a R$ 4.000/mês219
12.4%
de R$ 1.001/mês a R$ 2.000/mês181
10.3%
de R$ 2.001/mês a R$ 3000/mês150
8.5%
de R$ 12.001/mês a R$ 16.000/mês82
 
4.6%
Menos de R$ 1.000/mês48
 
2.7%
de R$ 16.001/mês a R$ 20.000/mês45
 
2.5%
de R$ 20.001/mês a R$ 25.000/mês15
 
0.8%
(Missing)238
13.5%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
r2993
33.1%
de1527
16.9%
a1466
16.2%
4.001/mês308
 
3.4%
6.000/mês308
 
3.4%
8.001/mês237
 
2.6%
12.000/mês237
 
2.6%
6.001/mês229
 
2.5%
8.000/mês229
 
2.5%
3.001/mês219
 
2.4%
Other values (15)1287
14.2%

Most occurring characters

ValueCountFrequency (%)
07560
16.6%
7513
16.5%
s3041
6.7%
m3006
 
6.6%
ê2993
 
6.6%
R2993
 
6.6%
$2993
 
6.6%
/2993
 
6.6%
.2843
 
6.2%
12154
 
4.7%
Other values (15)7556
16.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter13743
30.1%
Decimal Number12506
27.4%
Space Separator7513
16.5%
Other Punctuation5836
12.8%
Uppercase Letter3054
 
6.7%
Currency Symbol2993
 
6.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s3041
22.1%
m3006
21.9%
ê2993
21.8%
e1575
11.5%
d1527
11.1%
a1479
10.8%
n48
 
0.3%
o48
 
0.3%
c13
 
0.1%
i13
 
0.1%
Decimal Number
ValueCountFrequency (%)
07560
60.5%
12154
 
17.2%
2738
 
5.9%
6664
 
5.3%
4527
 
4.2%
8466
 
3.7%
3369
 
3.0%
528
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
R2993
98.0%
M48
 
1.6%
A13
 
0.4%
Other Punctuation
ValueCountFrequency (%)
/2993
51.3%
.2843
48.7%
Space Separator
ValueCountFrequency (%)
7513
100.0%
Currency Symbol
ValueCountFrequency (%)
$2993
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common28848
63.2%
Latin16797
36.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
s3041
18.1%
m3006
17.9%
ê2993
17.8%
R2993
17.8%
e1575
9.4%
d1527
9.1%
a1479
8.8%
M48
 
0.3%
n48
 
0.3%
o48
 
0.3%
Other values (3)39
 
0.2%
Common
ValueCountFrequency (%)
07560
26.2%
7513
26.0%
$2993
 
10.4%
/2993
 
10.4%
.2843
 
9.9%
12154
 
7.5%
2738
 
2.6%
6664
 
2.3%
4527
 
1.8%
8466
 
1.6%
Other values (2)397
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII42652
93.4%
None2993
 
6.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
07560
17.7%
7513
17.6%
s3041
7.1%
m3006
 
7.0%
R2993
 
7.0%
$2993
 
7.0%
/2993
 
7.0%
.2843
 
6.7%
12154
 
5.1%
e1575
 
3.7%
Other values (14)5981
14.0%
None
ValueCountFrequency (%)
ê2993
100.0%

('P17', 'time_experience_data_science')
Categorical

HIGH CORRELATION

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Menos de 1 ano
445 
de 1 a 2 anos
343 
de 2 a 3 anos
244 
Não tenho experiência na área de dados
221 
de 4 a 5 anos
186 
Other values (2)
326 

Length

Max length38
Median length15
Mean length16.65042493
Min length13

Characters and Unicode

Total characters29388
Distinct characters26
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão tenho experiência na área de dados
2nd rowMenos de 1 ano
3rd rowde 1 a 2 anos
4th rowMenos de 1 ano
5th rowde 4 a 5 anos

Common Values

ValueCountFrequency (%)
Menos de 1 ano445
25.2%
de 1 a 2 anos343
19.4%
de 2 a 3 anos244
13.8%
Não tenho experiência na área de dados221
12.5%
de 4 a 5 anos186
10.5%
de 6 a 10 anos179
10.1%
Mais de 10 anos147
 
8.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
de1765
20.3%
anos1099
12.7%
a952
11.0%
1788
9.1%
2587
 
6.8%
menos445
 
5.1%
ano445
 
5.1%
10326
 
3.8%
3244
 
2.8%
área221
 
2.5%
Other values (9)1803
20.8%

Most occurring characters

ValueCountFrequency (%)
6910
23.5%
a3527
12.0%
e3094
10.5%
n2652
 
9.0%
o2652
 
9.0%
d2207
 
7.5%
s1912
 
6.5%
11114
 
3.8%
M592
 
2.0%
i589
 
2.0%
Other values (16)4139
14.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter18843
64.1%
Space Separator6910
 
23.5%
Decimal Number2822
 
9.6%
Uppercase Letter813
 
2.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a3527
18.7%
e3094
16.4%
n2652
14.1%
o2652
14.1%
d2207
11.7%
s1912
10.1%
i589
 
3.1%
r442
 
2.3%
ã221
 
1.2%
h221
 
1.2%
Other values (6)1326
 
7.0%
Decimal Number
ValueCountFrequency (%)
11114
39.5%
2587
20.8%
0326
 
11.6%
3244
 
8.6%
4186
 
6.6%
5186
 
6.6%
6179
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
M592
72.8%
N221
 
27.2%
Space Separator
ValueCountFrequency (%)
6910
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin19656
66.9%
Common9732
33.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a3527
17.9%
e3094
15.7%
n2652
13.5%
o2652
13.5%
d2207
11.2%
s1912
9.7%
M592
 
3.0%
i589
 
3.0%
r442
 
2.2%
ã221
 
1.1%
Other values (8)1768
9.0%
Common
ValueCountFrequency (%)
6910
71.0%
11114
 
11.4%
2587
 
6.0%
0326
 
3.3%
3244
 
2.5%
4186
 
1.9%
5186
 
1.9%
6179
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII28725
97.7%
None663
 
2.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6910
24.1%
a3527
12.3%
e3094
10.8%
n2652
 
9.2%
o2652
 
9.2%
d2207
 
7.7%
s1912
 
6.7%
11114
 
3.9%
M592
 
2.1%
i589
 
2.1%
Other values (13)3476
12.1%
None
ValueCountFrequency (%)
ã221
33.3%
ê221
33.3%
á221
33.3%

('P18', 'time_experience_before')
Categorical

HIGH CORRELATION

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Não tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados
555 
Menos de 1 ano
333 
de 1 a 2 anos
210 
de 6 a 10 anos
188 
Mais de 10 anos
181 
Other values (2)
298 

Length

Max length103
Median length15
Mean length41.80056657
Min length13

Characters and Unicode

Total characters73778
Distinct characters39
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados
2nd rowNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados
3rd rowde 6 a 10 anos
4th rowde 2 a 3 anos
5th rowde 4 a 5 anos

Common Values

ValueCountFrequency (%)
Não tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados555
31.4%
Menos de 1 ano333
18.9%
de 1 a 2 anos210
 
11.9%
de 6 a 10 anos188
 
10.7%
Mais de 10 anos181
 
10.3%
de 4 a 5 anos161
 
9.1%
de 2 a 3 anos137
 
7.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
de3430
22.1%
a1251
 
8.1%
na1110
 
7.1%
área1110
 
7.1%
anos877
 
5.6%
não555
 
3.6%
começar555
 
3.6%
tive555
 
3.6%
trabalhar555
 
3.6%
dados555
 
3.6%
Other values (14)4973
32.0%

Most occurring characters

ValueCountFrequency (%)
13761
18.7%
a10412
14.1%
e8758
11.9%
n4873
 
6.6%
d4540
 
6.2%
r4440
 
6.0%
o3763
 
5.1%
s2501
 
3.4%
i2401
 
3.3%
t2220
 
3.0%
Other values (29)16109
21.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter53898
73.1%
Space Separator13761
 
18.7%
Uppercase Letter3289
 
4.5%
Decimal Number2275
 
3.1%
Other Punctuation555
 
0.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a10412
19.3%
e8758
16.2%
n4873
9.0%
d4540
8.4%
r4440
8.2%
o3763
 
7.0%
s2501
 
4.6%
i2401
 
4.5%
t2220
 
4.1%
á1110
 
2.1%
Other values (14)8880
16.5%
Decimal Number
ValueCountFrequency (%)
1912
40.1%
0369
16.2%
2347
 
15.3%
6188
 
8.3%
4161
 
7.1%
5161
 
7.1%
3137
 
6.0%
Uppercase Letter
ValueCountFrequency (%)
N555
16.9%
S555
16.9%
E555
16.9%
I555
16.9%
T555
16.9%
M514
15.6%
Space Separator
ValueCountFrequency (%)
13761
100.0%
Other Punctuation
ValueCountFrequency (%)
/555
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin57187
77.5%
Common16591
 
22.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a10412
18.2%
e8758
15.3%
n4873
8.5%
d4540
 
7.9%
r4440
 
7.8%
o3763
 
6.6%
s2501
 
4.4%
i2401
 
4.2%
t2220
 
3.9%
á1110
 
1.9%
Other values (20)12169
21.3%
Common
ValueCountFrequency (%)
13761
82.9%
1912
 
5.5%
/555
 
3.3%
0369
 
2.2%
2347
 
2.1%
6188
 
1.1%
4161
 
1.0%
5161
 
1.0%
3137
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII71003
96.2%
None2775
 
3.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13761
19.4%
a10412
14.7%
e8758
12.3%
n4873
 
6.9%
d4540
 
6.4%
r4440
 
6.3%
o3763
 
5.3%
s2501
 
3.5%
i2401
 
3.4%
t2220
 
3.1%
Other values (25)13334
18.8%
None
ValueCountFrequency (%)
á1110
40.0%
ç555
20.0%
ã555
20.0%
ê555
20.0%

('P19', 'is_data_science_professional')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
1
915 
0
850 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1915
51.8%
0850
48.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1915
51.8%
0850
48.2%

Most occurring characters

ValueCountFrequency (%)
1915
51.8%
0850
48.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1915
51.8%
0850
48.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1915
51.8%
0850
48.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1915
51.8%
0850
48.2%

('P20', 'linear_regression')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1241 
1
524 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01241
70.3%
1524
29.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01241
70.3%
1524
29.7%

Most occurring characters

ValueCountFrequency (%)
01241
70.3%
1524
29.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01241
70.3%
1524
29.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01241
70.3%
1524
29.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01241
70.3%
1524
29.7%

('P20', 'logistic_regression')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1359 
1
406 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01359
77.0%
1406
 
23.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01359
77.0%
1406
 
23.0%

Most occurring characters

ValueCountFrequency (%)
01359
77.0%
1406
 
23.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01359
77.0%
1406
 
23.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01359
77.0%
1406
 
23.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01359
77.0%
1406
 
23.0%

('P20', 'glms')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1631 
1
 
134

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01631
92.4%
1134
 
7.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01631
92.4%
1134
 
7.6%

Most occurring characters

ValueCountFrequency (%)
01631
92.4%
1134
 
7.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01631
92.4%
1134
 
7.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01631
92.4%
1134
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01631
92.4%
1134
 
7.6%

('P20', 'decision_tree')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1324 
1
441 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01324
75.0%
1441
 
25.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01324
75.0%
1441
 
25.0%

Most occurring characters

ValueCountFrequency (%)
01324
75.0%
1441
 
25.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01324
75.0%
1441
 
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01324
75.0%
1441
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01324
75.0%
1441
 
25.0%

('P20', 'random_forest')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1406 
1
359 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
01406
79.7%
1359
 
20.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01406
79.7%
1359
 
20.3%

Most occurring characters

ValueCountFrequency (%)
01406
79.7%
1359
 
20.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01406
79.7%
1359
 
20.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01406
79.7%
1359
 
20.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01406
79.7%
1359
 
20.3%

('P20', 'neural_networks')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1498 
1
267 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01498
84.9%
1267
 
15.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01498
84.9%
1267
 
15.1%

Most occurring characters

ValueCountFrequency (%)
01498
84.9%
1267
 
15.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01498
84.9%
1267
 
15.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01498
84.9%
1267
 
15.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01498
84.9%
1267
 
15.1%

('P20', 'bayesian_inference')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1541 
1
224 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01541
87.3%
1224
 
12.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01541
87.3%
1224
 
12.7%

Most occurring characters

ValueCountFrequency (%)
01541
87.3%
1224
 
12.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01541
87.3%
1224
 
12.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01541
87.3%
1224
 
12.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01541
87.3%
1224
 
12.7%

('P20', 'ensemble')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1596 
1
169 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01596
90.4%
1169
 
9.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01596
90.4%
1169
 
9.6%

Most occurring characters

ValueCountFrequency (%)
01596
90.4%
1169
 
9.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01596
90.4%
1169
 
9.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01596
90.4%
1169
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01596
90.4%
1169
 
9.6%

('P20', 'svms')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1585 
1
180 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01585
89.8%
1180
 
10.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01585
89.8%
1180
 
10.2%

Most occurring characters

ValueCountFrequency (%)
01585
89.8%
1180
 
10.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01585
89.8%
1180
 
10.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01585
89.8%
1180
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01585
89.8%
1180
 
10.2%

('P20', 'cnns')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1664 
1
 
101

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring characters

ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

('P20', 'rnns')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1667 
1
 
98

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01667
94.4%
198
 
5.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01667
94.4%
198
 
5.6%

Most occurring characters

ValueCountFrequency (%)
01667
94.4%
198
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01667
94.4%
198
 
5.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01667
94.4%
198
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01667
94.4%
198
 
5.6%

('P20', 'hmms')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1747 
1
 
18

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring characters

ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

('P20', 'gans')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1741 
1
 
24

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring characters

ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

('P20', 'markov_chains')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1686 
1
 
79

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring characters

ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

('P20', 'nlp')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1513 
1
252 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01513
85.7%
1252
 
14.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01513
85.7%
1252
 
14.3%

Most occurring characters

ValueCountFrequency (%)
01513
85.7%
1252
 
14.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01513
85.7%
1252
 
14.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01513
85.7%
1252
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01513
85.7%
1252
 
14.3%

('P20', 'gradient_boosted_machines')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1588 
1
177 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01588
90.0%
1177
 
10.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01588
90.0%
1177
 
10.0%

Most occurring characters

ValueCountFrequency (%)
01588
90.0%
1177
 
10.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01588
90.0%
1177
 
10.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01588
90.0%
1177
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01588
90.0%
1177
 
10.0%

('P20', 'cluster_analysis')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1397 
1
368 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01397
79.2%
1368
 
20.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01397
79.2%
1368
 
20.8%

Most occurring characters

ValueCountFrequency (%)
01397
79.2%
1368
 
20.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01397
79.2%
1368
 
20.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01397
79.2%
1368
 
20.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01397
79.2%
1368
 
20.8%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1693 
1
 
72

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01693
95.9%
172
 
4.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01693
95.9%
172
 
4.1%

Most occurring characters

ValueCountFrequency (%)
01693
95.9%
172
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01693
95.9%
172
 
4.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01693
95.9%
172
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01693
95.9%
172
 
4.1%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1664 
1
 
101

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring characters

ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01664
94.3%
1101
 
5.7%

('P20', 'joint analysis')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1686 
1
 
79

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring characters

ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01686
95.5%
179
 
4.5%

('P20', 'no_listed_methods')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1573 
1
192 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01573
89.1%
1192
 
10.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01573
89.1%
1192
 
10.9%

Most occurring characters

ValueCountFrequency (%)
01573
89.1%
1192
 
10.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01573
89.1%
1192
 
10.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01573
89.1%
1192
 
10.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01573
89.1%
1192
 
10.9%

('P21', 'sql_')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1051 
1
714 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row1
5th row1

Common Values

ValueCountFrequency (%)
01051
59.5%
1714
40.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01051
59.5%
1714
40.5%

Most occurring characters

ValueCountFrequency (%)
01051
59.5%
1714
40.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01051
59.5%
1714
40.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01051
59.5%
1714
40.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01051
59.5%
1714
40.5%

('P21', 'r')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1447 
1
318 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01447
82.0%
1318
 
18.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01447
82.0%
1318
 
18.0%

Most occurring characters

ValueCountFrequency (%)
01447
82.0%
1318
 
18.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01447
82.0%
1318
 
18.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01447
82.0%
1318
 
18.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01447
82.0%
1318
 
18.0%

('P21', 'python')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
981 
1
784 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
0981
55.6%
1784
44.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0981
55.6%
1784
44.4%

Most occurring characters

ValueCountFrequency (%)
0981
55.6%
1784
44.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0981
55.6%
1784
44.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0981
55.6%
1784
44.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0981
55.6%
1784
44.4%

('P21', 'c_c++_c#')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1700 
1
 
65

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01700
96.3%
165
 
3.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01700
96.3%
165
 
3.7%

Most occurring characters

ValueCountFrequency (%)
01700
96.3%
165
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01700
96.3%
165
 
3.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01700
96.3%
165
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01700
96.3%
165
 
3.7%

('P21', 'dotnet')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1730 
1
 
35

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring characters

ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

('P21', 'java')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1676 
1
 
89

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01676
95.0%
189
 
5.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01676
95.0%
189
 
5.0%

Most occurring characters

ValueCountFrequency (%)
01676
95.0%
189
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01676
95.0%
189
 
5.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01676
95.0%
189
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01676
95.0%
189
 
5.0%

('P21', 'julia')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1751 
1
 
14

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring characters

ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

('P21', 'sas_stata')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1704 
1
 
61

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01704
96.5%
161
 
3.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01704
96.5%
161
 
3.5%

Most occurring characters

ValueCountFrequency (%)
01704
96.5%
161
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01704
96.5%
161
 
3.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01704
96.5%
161
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01704
96.5%
161
 
3.5%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1681 
1
 
84

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring characters

ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

('P21', 'scala')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1703 
1
 
62

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01703
96.5%
162
 
3.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01703
96.5%
162
 
3.5%

Most occurring characters

ValueCountFrequency (%)
01703
96.5%
162
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01703
96.5%
162
 
3.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01703
96.5%
162
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01703
96.5%
162
 
3.5%

('P21', 'matlab')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1736 
1
 
29

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01736
98.4%
129
 
1.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01736
98.4%
129
 
1.6%

Most occurring characters

ValueCountFrequency (%)
01736
98.4%
129
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01736
98.4%
129
 
1.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01736
98.4%
129
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01736
98.4%
129
 
1.6%

('P21', 'php')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1717 
1
 
48

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01717
97.3%
148
 
2.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01717
97.3%
148
 
2.7%

Most occurring characters

ValueCountFrequency (%)
01717
97.3%
148
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01717
97.3%
148
 
2.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01717
97.3%
148
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01717
97.3%
148
 
2.7%

('P21', 'no_listed_languages')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1757 
1
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01757
99.5%
18
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01757
99.5%
18
 
0.5%

Most occurring characters

ValueCountFrequency (%)
01757
99.5%
18
 
0.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01757
99.5%
18
 
0.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01757
99.5%
18
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01757
99.5%
18
 
0.5%

('P22', 'most_used_proggraming_languages')
Categorical

HIGH CORRELATION
MISSING

Distinct9
Distinct (%)1.0%
Missing859
Missing (%)48.7%
Memory size13.9 KiB
Python
459 
SQL
279 
R
91 
SAS/Stata
 
19
Java
 
17
Other values (4)
 
41

Length

Max length43
Median length6
Mean length5.149006623
Min length1

Characters and Unicode

Total characters4665
Distinct characters33
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPython
2nd rowPython
3rd rowSQL
4th rowPython
5th rowJava

Common Values

ValueCountFrequency (%)
Python459
26.0%
SQL279
 
15.8%
R91
 
5.2%
SAS/Stata19
 
1.1%
Java17
 
1.0%
Visual Basic/VBA12
 
0.7%
Scala10
 
0.6%
Não utilizo nenhuma das linguagens listadas10
 
0.6%
C/C++/C#9
 
0.5%
(Missing)859
48.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
python459
47.4%
sql279
28.8%
r91
 
9.4%
sas/stata19
 
2.0%
java17
 
1.8%
visual12
 
1.2%
basic/vba12
 
1.2%
scala10
 
1.0%
não10
 
1.0%
utilizo10
 
1.0%
Other values (5)49
 
5.1%

Most occurring characters

ValueCountFrequency (%)
t517
11.1%
n499
10.7%
o479
10.3%
h469
10.1%
P459
9.8%
y459
9.8%
S346
7.4%
Q279
 
6.0%
L279
 
6.0%
a166
 
3.6%
Other values (23)713
15.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2940
63.0%
Uppercase Letter1587
34.0%
Space Separator62
 
1.3%
Other Punctuation58
 
1.2%
Math Symbol18
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t517
17.6%
n499
17.0%
o479
16.3%
h469
16.0%
y459
15.6%
a166
 
5.6%
s64
 
2.2%
i64
 
2.2%
l52
 
1.8%
u42
 
1.4%
Other values (8)129
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
P459
28.9%
S346
21.8%
Q279
17.6%
L279
17.6%
R91
 
5.7%
A31
 
2.0%
C27
 
1.7%
V24
 
1.5%
B24
 
1.5%
J17
 
1.1%
Other Punctuation
ValueCountFrequency (%)
/49
84.5%
#9
 
15.5%
Space Separator
ValueCountFrequency (%)
62
100.0%
Math Symbol
ValueCountFrequency (%)
+18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4527
97.0%
Common138
 
3.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
t517
11.4%
n499
11.0%
o479
10.6%
h469
10.4%
P459
10.1%
y459
10.1%
S346
7.6%
Q279
6.2%
L279
6.2%
a166
 
3.7%
Other values (19)575
12.7%
Common
ValueCountFrequency (%)
62
44.9%
/49
35.5%
+18
 
13.0%
#9
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII4655
99.8%
None10
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t517
11.1%
n499
10.7%
o479
10.3%
h469
10.1%
P459
9.9%
y459
9.9%
S346
7.4%
Q279
 
6.0%
L279
 
6.0%
a166
 
3.6%
Other values (22)703
15.1%
None
ValueCountFrequency (%)
ã10
100.0%

('P23', 'sql')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
931 
1
834 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row1
5th row1

Common Values

ValueCountFrequency (%)
0931
52.7%
1834
47.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0931
52.7%
1834
47.3%

Most occurring characters

ValueCountFrequency (%)
0931
52.7%
1834
47.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0931
52.7%
1834
47.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0931
52.7%
1834
47.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0931
52.7%
1834
47.3%

('P23', 'nosql')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1388 
1
377 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01388
78.6%
1377
 
21.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01388
78.6%
1377
 
21.4%

Most occurring characters

ValueCountFrequency (%)
01388
78.6%
1377
 
21.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01388
78.6%
1377
 
21.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01388
78.6%
1377
 
21.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01388
78.6%
1377
 
21.4%

('P23', 'images')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1605 
1
 
160

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01605
90.9%
1160
 
9.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01605
90.9%
1160
 
9.1%

Most occurring characters

ValueCountFrequency (%)
01605
90.9%
1160
 
9.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01605
90.9%
1160
 
9.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01605
90.9%
1160
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01605
90.9%
1160
 
9.1%

('P23', 'nlp')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1435 
1
330 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01435
81.3%
1330
 
18.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01435
81.3%
1330
 
18.7%

Most occurring characters

ValueCountFrequency (%)
01435
81.3%
1330
 
18.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01435
81.3%
1330
 
18.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01435
81.3%
1330
 
18.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01435
81.3%
1330
 
18.7%

('P23', 'videos')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1729 
1
 
36

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring characters

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

('P23', 'sheets')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1007 
1
758 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01007
57.1%
1758
42.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01007
57.1%
1758
42.9%

Most occurring characters

ValueCountFrequency (%)
01007
57.1%
1758
42.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01007
57.1%
1758
42.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01007
57.1%
1758
42.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01007
57.1%
1758
42.9%

('P23', 'other')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1730 
1
 
35

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring characters

ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01730
98.0%
135
 
2.0%

('P24', 'sql')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1084 
1
681 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01084
61.4%
1681
38.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01084
61.4%
1681
38.6%

Most occurring characters

ValueCountFrequency (%)
01084
61.4%
1681
38.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01084
61.4%
1681
38.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01084
61.4%
1681
38.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01084
61.4%
1681
38.6%

('P24', 'nosql')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1629 
1
 
136

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01629
92.3%
1136
 
7.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01629
92.3%
1136
 
7.7%

Most occurring characters

ValueCountFrequency (%)
01629
92.3%
1136
 
7.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01629
92.3%
1136
 
7.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01629
92.3%
1136
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01629
92.3%
1136
 
7.7%

('P24', 'imagens')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1743 
1
 
22

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01743
98.8%
122
 
1.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01743
98.8%
122
 
1.2%

Most occurring characters

ValueCountFrequency (%)
01743
98.8%
122
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01743
98.8%
122
 
1.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01743
98.8%
122
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01743
98.8%
122
 
1.2%

('P24', 'nlp')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1681 
1
 
84

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring characters

ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01681
95.2%
184
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01681
95.2%
184
 
4.8%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1760 
1
 
5

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring characters

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

('P24', 'planilhas')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1381 
1
384 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01381
78.2%
1384
 
21.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01381
78.2%
1384
 
21.8%

Most occurring characters

ValueCountFrequency (%)
01381
78.2%
1384
 
21.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01381
78.2%
1384
 
21.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01381
78.2%
1384
 
21.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01381
78.2%
1384
 
21.8%

('P24', 'other')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1755 
1
 
10

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring characters

ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

('P25', 'aws')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1353 
1
412 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01353
76.7%
1412
 
23.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01353
76.7%
1412
 
23.3%

Most occurring characters

ValueCountFrequency (%)
01353
76.7%
1412
 
23.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01353
76.7%
1412
 
23.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01353
76.7%
1412
 
23.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01353
76.7%
1412
 
23.3%

('P25', 'gcp')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1540 
1
225 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01540
87.3%
1225
 
12.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01540
87.3%
1225
 
12.7%

Most occurring characters

ValueCountFrequency (%)
01540
87.3%
1225
 
12.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01540
87.3%
1225
 
12.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01540
87.3%
1225
 
12.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01540
87.3%
1225
 
12.7%

('P25', 'azure')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1574 
1
191 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01574
89.2%
1191
 
10.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01574
89.2%
1191
 
10.8%

Most occurring characters

ValueCountFrequency (%)
01574
89.2%
1191
 
10.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01574
89.2%
1191
 
10.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01574
89.2%
1191
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01574
89.2%
1191
 
10.8%

('P25', 'ibm')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1721 
1
 
44

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring characters

ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

('P25', 'on_premise_servers')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1613 
1
 
152

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01613
91.4%
1152
 
8.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01613
91.4%
1152
 
8.6%

Most occurring characters

ValueCountFrequency (%)
01613
91.4%
1152
 
8.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01613
91.4%
1152
 
8.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01613
91.4%
1152
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01613
91.4%
1152
 
8.6%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1602 
1
163 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01602
90.8%
1163
 
9.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01602
90.8%
1163
 
9.2%

Most occurring characters

ValueCountFrequency (%)
01602
90.8%
1163
 
9.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01602
90.8%
1163
 
9.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01602
90.8%
1163
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01602
90.8%
1163
 
9.2%

('P25', 'other')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1696 
1
 
69

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01696
96.1%
169
 
3.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01696
96.1%
169
 
3.9%

Most occurring characters

ValueCountFrequency (%)
01696
96.1%
169
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01696
96.1%
169
 
3.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01696
96.1%
169
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01696
96.1%
169
 
3.9%

('P26', 'mysql')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1399 
1
366 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01399
79.3%
1366
 
20.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01399
79.3%
1366
 
20.7%

Most occurring characters

ValueCountFrequency (%)
01399
79.3%
1366
 
20.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01399
79.3%
1366
 
20.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01399
79.3%
1366
 
20.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01399
79.3%
1366
 
20.7%

('P26', 'oracle')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1544 
1
221 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01544
87.5%
1221
 
12.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01544
87.5%
1221
 
12.5%

Most occurring characters

ValueCountFrequency (%)
01544
87.5%
1221
 
12.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01544
87.5%
1221
 
12.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01544
87.5%
1221
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01544
87.5%
1221
 
12.5%

('P26', 'sql_server')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1431 
1
334 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01431
81.1%
1334
 
18.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01431
81.1%
1334
 
18.9%

Most occurring characters

ValueCountFrequency (%)
01431
81.1%
1334
 
18.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01431
81.1%
1334
 
18.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01431
81.1%
1334
 
18.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01431
81.1%
1334
 
18.9%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1742 
1
 
23

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01742
98.7%
123
 
1.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01742
98.7%
123
 
1.3%

Most occurring characters

ValueCountFrequency (%)
01742
98.7%
123
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01742
98.7%
123
 
1.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01742
98.7%
123
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01742
98.7%
123
 
1.3%

('P26', 'dynamodb')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1724 
1
 
41

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01724
97.7%
141
 
2.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01724
97.7%
141
 
2.3%

Most occurring characters

ValueCountFrequency (%)
01724
97.7%
141
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01724
97.7%
141
 
2.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01724
97.7%
141
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01724
97.7%
141
 
2.3%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1760 
1
 
5

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring characters

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1738 
1
 
27

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring characters

ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

('P26', 'mongodb')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1558 
1
207 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01558
88.3%
1207
 
11.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01558
88.3%
1207
 
11.7%

Most occurring characters

ValueCountFrequency (%)
01558
88.3%
1207
 
11.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01558
88.3%
1207
 
11.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01558
88.3%
1207
 
11.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01558
88.3%
1207
 
11.7%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1701 
1
 
64

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring characters

ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01701
96.4%
164
 
3.6%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1762 
1
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01762
99.8%
13
 
0.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01762
99.8%
13
 
0.2%

Most occurring characters

ValueCountFrequency (%)
01762
99.8%
13
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01762
99.8%
13
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01762
99.8%
13
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01762
99.8%
13
 
0.2%

('P26', 's3')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1576 
1
189 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01576
89.3%
1189
 
10.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01576
89.3%
1189
 
10.7%

Most occurring characters

ValueCountFrequency (%)
01576
89.3%
1189
 
10.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01576
89.3%
1189
 
10.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01576
89.3%
1189
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01576
89.3%
1189
 
10.7%

('P26', 'postgresql')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1420 
1
345 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01420
80.5%
1345
 
19.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01420
80.5%
1345
 
19.5%

Most occurring characters

ValueCountFrequency (%)
01420
80.5%
1345
 
19.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01420
80.5%
1345
 
19.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01420
80.5%
1345
 
19.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01420
80.5%
1345
 
19.5%

('P26', 'elaticsearch')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1650 
1
 
115

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01650
93.5%
1115
 
6.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01650
93.5%
1115
 
6.5%

Most occurring characters

ValueCountFrequency (%)
01650
93.5%
1115
 
6.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01650
93.5%
1115
 
6.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01650
93.5%
1115
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01650
93.5%
1115
 
6.5%

('P26', 'db2')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1726 
1
 
39

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring characters

ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01726
97.8%
139
 
2.2%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1731 
1
 
34

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01731
98.1%
134
 
1.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01731
98.1%
134
 
1.9%

Most occurring characters

ValueCountFrequency (%)
01731
98.1%
134
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01731
98.1%
134
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01731
98.1%
134
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01731
98.1%
134
 
1.9%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1663 
1
 
102

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01663
94.2%
1102
 
5.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01663
94.2%
1102
 
5.8%

Most occurring characters

ValueCountFrequency (%)
01663
94.2%
1102
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01663
94.2%
1102
 
5.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01663
94.2%
1102
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01663
94.2%
1102
 
5.8%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1747 
1
 
18

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring characters

ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01747
99.0%
118
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01747
99.0%
118
 
1.0%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1725 
1
 
40

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01725
97.7%
140
 
2.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01725
97.7%
140
 
2.3%

Most occurring characters

ValueCountFrequency (%)
01725
97.7%
140
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01725
97.7%
140
 
2.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01725
97.7%
140
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01725
97.7%
140
 
2.3%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1760 
1
 
5

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring characters

ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01760
99.7%
15
 
0.3%

('P26', 'redis')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1721 
1
 
44

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring characters

ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01721
97.5%
144
 
2.5%

('P26', 'neo4j')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1744 
1
 
21

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01744
98.8%
121
 
1.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01744
98.8%
121
 
1.2%

Most occurring characters

ValueCountFrequency (%)
01744
98.8%
121
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01744
98.8%
121
 
1.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01744
98.8%
121
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01744
98.8%
121
 
1.2%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1729 
1
 
36

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring characters

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

('P26', 'hbase')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1732 
1
 
33

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01732
98.1%
133
 
1.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01732
98.1%
133
 
1.9%

Most occurring characters

ValueCountFrequency (%)
01732
98.1%
133
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01732
98.1%
133
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01732
98.1%
133
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01732
98.1%
133
 
1.9%

('P26', 'other')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1671 
1
 
94

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01671
94.7%
194
 
5.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01671
94.7%
194
 
5.3%

Most occurring characters

ValueCountFrequency (%)
01671
94.7%
194
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01671
94.7%
194
 
5.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01671
94.7%
194
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01671
94.7%
194
 
5.3%

('P27', 'microsoft_powerbi')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1372 
1
393 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01372
77.7%
1393
 
22.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01372
77.7%
1393
 
22.3%

Most occurring characters

ValueCountFrequency (%)
01372
77.7%
1393
 
22.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01372
77.7%
1393
 
22.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01372
77.7%
1393
 
22.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01372
77.7%
1393
 
22.3%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1670 
1
 
95

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01670
94.6%
195
 
5.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01670
94.6%
195
 
5.4%

Most occurring characters

ValueCountFrequency (%)
01670
94.6%
195
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01670
94.6%
195
 
5.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01670
94.6%
195
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01670
94.6%
195
 
5.4%

('P27', 'tableau')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1556 
1
209 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01556
88.2%
1209
 
11.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01556
88.2%
1209
 
11.8%

Most occurring characters

ValueCountFrequency (%)
01556
88.2%
1209
 
11.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01556
88.2%
1209
 
11.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01556
88.2%
1209
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01556
88.2%
1209
 
11.8%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1621 
1
 
144

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01621
91.8%
1144
 
8.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01621
91.8%
1144
 
8.2%

Most occurring characters

ValueCountFrequency (%)
01621
91.8%
1144
 
8.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01621
91.8%
1144
 
8.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01621
91.8%
1144
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01621
91.8%
1144
 
8.2%

('P27', 'superset')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1748 
1
 
17

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring characters

ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

('P27', 'redash')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1739 
1
 
26

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01739
98.5%
126
 
1.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01739
98.5%
126
 
1.5%

Most occurring characters

ValueCountFrequency (%)
01739
98.5%
126
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01739
98.5%
126
 
1.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01739
98.5%
126
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01739
98.5%
126
 
1.5%

('P27', 'microstrategy')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1738 
1
 
27

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring characters

ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01738
98.5%
127
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01738
98.5%
127
 
1.5%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1745 
1
 
20

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring characters

ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

('P27', 'sap_business_objects')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1748 
1
 
17

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring characters

ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01748
99.0%
117
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01748
99.0%
117
 
1.0%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1755 
1
 
10

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring characters

ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01755
99.4%
110
 
0.6%

('P27', 'birst')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1763 
1
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01763
99.9%
12
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01763
99.9%
12
 
0.1%

Most occurring characters

ValueCountFrequency (%)
01763
99.9%
12
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01763
99.9%
12
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01763
99.9%
12
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01763
99.9%
12
 
0.1%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1750 
1
 
15

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring characters

ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

('P27', 'google_data_studio')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1636 
1
 
129

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01636
92.7%
1129
 
7.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01636
92.7%
1129
 
7.3%

Most occurring characters

ValueCountFrequency (%)
01636
92.7%
1129
 
7.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01636
92.7%
1129
 
7.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01636
92.7%
1129
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01636
92.7%
1129
 
7.3%

('P27', 'only_excel_gsheets')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1660 
1
 
105

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01660
94.1%
1105
 
5.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01660
94.1%
1105
 
5.9%

Most occurring characters

ValueCountFrequency (%)
01660
94.1%
1105
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01660
94.1%
1105
 
5.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01660
94.1%
1105
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01660
94.1%
1105
 
5.9%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1586 
1
179 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01586
89.9%
1179
 
10.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01586
89.9%
1179
 
10.1%

Most occurring characters

ValueCountFrequency (%)
01586
89.9%
1179
 
10.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01586
89.9%
1179
 
10.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01586
89.9%
1179
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01586
89.9%
1179
 
10.1%

('P27', 'other')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1682 
1
 
83

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01682
95.3%
183
 
4.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01682
95.3%
183
 
4.7%

Most occurring characters

ValueCountFrequency (%)
01682
95.3%
183
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01682
95.3%
183
 
4.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01682
95.3%
183
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01682
95.3%
183
 
4.7%

('P28', 'sql_&_stored_procedures')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1472 
1
293 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01472
83.4%
1293
 
16.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01472
83.4%
1293
 
16.6%

Most occurring characters

ValueCountFrequency (%)
01472
83.4%
1293
 
16.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01472
83.4%
1293
 
16.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01472
83.4%
1293
 
16.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01472
83.4%
1293
 
16.6%

('P28', 'apache_airflow')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1609 
1
 
156

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01609
91.2%
1156
 
8.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01609
91.2%
1156
 
8.8%

Most occurring characters

ValueCountFrequency (%)
01609
91.2%
1156
 
8.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01609
91.2%
1156
 
8.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01609
91.2%
1156
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01609
91.2%
1156
 
8.8%

('P28', 'luigi')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1750 
1
 
15

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring characters

ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01750
99.2%
115
 
0.8%

('P28', 'aws_glue')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1661 
1
 
104

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01661
94.1%
1104
 
5.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01661
94.1%
1104
 
5.9%

Most occurring characters

ValueCountFrequency (%)
01661
94.1%
1104
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01661
94.1%
1104
 
5.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01661
94.1%
1104
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01661
94.1%
1104
 
5.9%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1723 
1
 
42

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01723
97.6%
142
 
2.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01723
97.6%
142
 
2.4%

Most occurring characters

ValueCountFrequency (%)
01723
97.6%
142
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01723
97.6%
142
 
2.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01723
97.6%
142
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01723
97.6%
142
 
2.4%

('P28', 'pentaho')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1595 
1
170 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01595
90.4%
1170
 
9.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01595
90.4%
1170
 
9.6%

Most occurring characters

ValueCountFrequency (%)
01595
90.4%
1170
 
9.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01595
90.4%
1170
 
9.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01595
90.4%
1170
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01595
90.4%
1170
 
9.6%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1728 
1
 
37

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring characters

ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

('P28', 'oracle_data_integrator')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1728 
1
 
37

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring characters

ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01728
97.9%
137
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01728
97.9%
137
 
2.1%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1726 
1
 
39

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring characters

ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01726
97.8%
139
 
2.2%

('P28', 'sap_bw_etl')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1741 
1
 
24

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring characters

ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01741
98.6%
124
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01741
98.6%
124
 
1.4%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1639 
1
 
126

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01639
92.9%
1126
 
7.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01639
92.9%
1126
 
7.1%

Most occurring characters

ValueCountFrequency (%)
01639
92.9%
1126
 
7.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01639
92.9%
1126
 
7.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01639
92.9%
1126
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01639
92.9%
1126
 
7.1%

('P28', 'other')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1560 
1
205 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01560
88.4%
1205
 
11.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01560
88.4%
1205
 
11.6%

Most occurring characters

ValueCountFrequency (%)
01560
88.4%
1205
 
11.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01560
88.4%
1205
 
11.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01560
88.4%
1205
 
11.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01560
88.4%
1205
 
11.6%

('P29', 'have_data_warehouse')
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.3%
Missing972
Missing (%)55.1%
Memory size13.9 KiB
1.0
461 
0.0
332 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters2379
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row1.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
1.0461
26.1%
0.0332
 
18.8%
(Missing)972
55.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.0461
58.1%
0.0332
41.9%

Most occurring characters

ValueCountFrequency (%)
01125
47.3%
.793
33.3%
1461
19.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1586
66.7%
Other Punctuation793
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01125
70.9%
1461
29.1%
Other Punctuation
ValueCountFrequency (%)
.793
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common2379
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01125
47.3%
.793
33.3%
1461
19.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII2379
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01125
47.3%
.793
33.3%
1461
19.4%

('P30', 'google_bigquery')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1711 
1
 
54

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01711
96.9%
154
 
3.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01711
96.9%
154
 
3.1%

Most occurring characters

ValueCountFrequency (%)
01711
96.9%
154
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01711
96.9%
154
 
3.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01711
96.9%
154
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01711
96.9%
154
 
3.1%

('P30', 'aws_redshift')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1673 
1
 
92

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01673
94.8%
192
 
5.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01673
94.8%
192
 
5.2%

Most occurring characters

ValueCountFrequency (%)
01673
94.8%
192
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01673
94.8%
192
 
5.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01673
94.8%
192
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01673
94.8%
192
 
5.2%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1751 
1
 
14

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring characters

ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01751
99.2%
114
 
0.8%

('P30', 'oracle')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1683 
1
 
82

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01683
95.4%
182
 
4.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01683
95.4%
182
 
4.6%

Most occurring characters

ValueCountFrequency (%)
01683
95.4%
182
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01683
95.4%
182
 
4.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01683
95.4%
182
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01683
95.4%
182
 
4.6%

('P30', 'postgres_mysql')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1658 
1
 
107

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01658
93.9%
1107
 
6.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01658
93.9%
1107
 
6.1%

Most occurring characters

ValueCountFrequency (%)
01658
93.9%
1107
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01658
93.9%
1107
 
6.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01658
93.9%
1107
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01658
93.9%
1107
 
6.1%

('P30', 'ibm')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1745 
1
 
20

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring characters

ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01745
98.9%
120
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01745
98.9%
120
 
1.1%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1737 
1
 
28

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01737
98.4%
128
 
1.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01737
98.4%
128
 
1.6%

Most occurring characters

ValueCountFrequency (%)
01737
98.4%
128
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01737
98.4%
128
 
1.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01737
98.4%
128
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01737
98.4%
128
 
1.6%

('P30', 'microsoft_azure')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1680 
1
 
85

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01680
95.2%
185
 
4.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01680
95.2%
185
 
4.8%

Most occurring characters

ValueCountFrequency (%)
01680
95.2%
185
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01680
95.2%
185
 
4.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01680
95.2%
185
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01680
95.2%
185
 
4.8%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1705 
1
 
60

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01705
96.6%
160
 
3.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01705
96.6%
160
 
3.4%

Most occurring characters

ValueCountFrequency (%)
01705
96.6%
160
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01705
96.6%
160
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01705
96.6%
160
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01705
96.6%
160
 
3.4%

('P30', 'other')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1729 
1
 
36

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring characters

ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01729
98.0%
136
 
2.0%

('P31', 'data_hackers_blog')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
1
1195 
0
570 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
11195
67.7%
0570
32.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
11195
67.7%
0570
32.3%

Most occurring characters

ValueCountFrequency (%)
11195
67.7%
0570
32.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
11195
67.7%
0570
32.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
11195
67.7%
0570
32.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11195
67.7%
0570
32.3%

('P31', 'data_hackers_podcast')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
1
1096 
0
669 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
11096
62.1%
0669
37.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
11096
62.1%
0669
37.9%

Most occurring characters

ValueCountFrequency (%)
11096
62.1%
0669
37.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
11096
62.1%
0669
37.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
11096
62.1%
0669
37.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11096
62.1%
0669
37.9%

('P31', 'weekly_newsletter')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
887 
1
878 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0887
50.3%
1878
49.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0887
50.3%
1878
49.7%

Most occurring characters

ValueCountFrequency (%)
0887
50.3%
1878
49.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0887
50.3%
1878
49.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0887
50.3%
1878
49.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0887
50.3%
1878
49.7%

('P31', 'slack_channel')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1037 
1
728 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01037
58.8%
1728
41.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01037
58.8%
1728
41.2%

Most occurring characters

ValueCountFrequency (%)
01037
58.8%
1728
41.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01037
58.8%
1728
41.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01037
58.8%
1728
41.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01037
58.8%
1728
41.2%

('P31', 'data_hackers_bootcamp')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1697 
1
 
68

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01697
96.1%
168
 
3.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01697
96.1%
168
 
3.9%

Most occurring characters

ValueCountFrequency (%)
01697
96.1%
168
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01697
96.1%
168
 
3.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01697
96.1%
168
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01697
96.1%
168
 
3.9%

('P31', 'do_not_know_data_hackers')
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1490 
1
275 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01490
84.4%
1275
 
15.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01490
84.4%
1275
 
15.6%

Most occurring characters

ValueCountFrequency (%)
01490
84.4%
1275
 
15.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01490
84.4%
1275
 
15.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01490
84.4%
1275
 
15.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01490
84.4%
1275
 
15.6%
Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Podcast do Data Hackers
618 
Blog/Medium do Data Hackers
423 
Ainda não conhecia o Data Hackers
251 
Newsletter Semanal
229 
Canal do Slack
205 

Length

Max length33
Median length27
Mean length23.70878187
Min length14

Characters and Unicode

Total characters41846
Distinct characters30
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAinda não conhecia o Data Hackers
2nd rowAinda não conhecia o Data Hackers
3rd rowNewsletter Semanal
4th rowAinda não conhecia o Data Hackers
5th rowBlog/Medium do Data Hackers

Common Values

ValueCountFrequency (%)
Podcast do Data Hackers618
35.0%
Blog/Medium do Data Hackers423
24.0%
Ainda não conhecia o Data Hackers251
14.2%
Newsletter Semanal229
 
13.0%
Canal do Slack205
 
11.6%
Bootcamp do Data Hackers39
 
2.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
data1331
19.3%
hackers1331
19.3%
do1285
18.6%
podcast618
9.0%
blog/medium423
 
6.1%
ainda251
 
3.6%
não251
 
3.6%
conhecia251
 
3.6%
o251
 
3.6%
newsletter229
 
3.3%
Other values (4)678
9.8%

Most occurring characters

ValueCountFrequency (%)
a6225
14.9%
5134
12.3%
o3157
 
7.5%
e2921
 
7.0%
c2695
 
6.4%
d2577
 
6.2%
t2446
 
5.8%
s2178
 
5.2%
r1560
 
3.7%
k1536
 
3.7%
Other values (20)11417
27.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter31005
74.1%
Uppercase Letter5284
 
12.6%
Space Separator5134
 
12.3%
Other Punctuation423
 
1.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a6225
20.1%
o3157
10.2%
e2921
9.4%
c2695
8.7%
d2577
8.3%
t2446
 
7.9%
s2178
 
7.0%
r1560
 
5.0%
k1536
 
5.0%
l1291
 
4.2%
Other values (9)4419
14.3%
Uppercase Letter
ValueCountFrequency (%)
H1331
25.2%
D1331
25.2%
P618
11.7%
B462
 
8.7%
S434
 
8.2%
M423
 
8.0%
A251
 
4.8%
N229
 
4.3%
C205
 
3.9%
Space Separator
ValueCountFrequency (%)
5134
100.0%
Other Punctuation
ValueCountFrequency (%)
/423
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin36289
86.7%
Common5557
 
13.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a6225
17.2%
o3157
 
8.7%
e2921
 
8.0%
c2695
 
7.4%
d2577
 
7.1%
t2446
 
6.7%
s2178
 
6.0%
r1560
 
4.3%
k1536
 
4.2%
H1331
 
3.7%
Other values (18)9663
26.6%
Common
ValueCountFrequency (%)
5134
92.4%
/423
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII41595
99.4%
None251
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a6225
15.0%
5134
12.3%
o3157
 
7.6%
e2921
 
7.0%
c2695
 
6.5%
d2577
 
6.2%
t2446
 
5.9%
s2178
 
5.2%
r1560
 
3.8%
k1536
 
3.7%
Other values (19)11166
26.8%
None
ValueCountFrequency (%)
ã251
100.0%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1289 
1
476 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01289
73.0%
1476
 
27.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01289
73.0%
1476
 
27.0%

Most occurring characters

ValueCountFrequency (%)
01289
73.0%
1476
 
27.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01289
73.0%
1476
 
27.0%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01289
73.0%
1476
 
27.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01289
73.0%
1476
 
27.0%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1380 
1
385 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01380
78.2%
1385
 
21.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01380
78.2%
1385
 
21.8%

Most occurring characters

ValueCountFrequency (%)
01380
78.2%
1385
 
21.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01380
78.2%
1385
 
21.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01380
78.2%
1385
 
21.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01380
78.2%
1385
 
21.8%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
1
1014 
0
751 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
11014
57.5%
0751
42.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
11014
57.5%
0751
42.5%

Most occurring characters

ValueCountFrequency (%)
11014
57.5%
0751
42.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
11014
57.5%
0751
42.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
11014
57.5%
0751
42.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11014
57.5%
0751
42.5%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1090 
1
675 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01090
61.8%
1675
38.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01090
61.8%
1675
38.2%

Most occurring characters

ValueCountFrequency (%)
01090
61.8%
1675
38.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01090
61.8%
1675
38.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01090
61.8%
1675
38.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01090
61.8%
1675
38.2%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1672 
1
 
93

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01672
94.7%
193
 
5.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01672
94.7%
193
 
5.3%

Most occurring characters

ValueCountFrequency (%)
01672
94.7%
193
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01672
94.7%
193
 
5.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01672
94.7%
193
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01672
94.7%
193
 
5.3%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1424 
1
341 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01424
80.7%
1341
 
19.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01424
80.7%
1341
 
19.3%

Most occurring characters

ValueCountFrequency (%)
01424
80.7%
1341
 
19.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01424
80.7%
1341
 
19.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01424
80.7%
1341
 
19.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01424
80.7%
1341
 
19.3%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
985 
1
780 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0985
55.8%
1780
44.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0985
55.8%
1780
44.2%

Most occurring characters

ValueCountFrequency (%)
0985
55.8%
1780
44.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0985
55.8%
1780
44.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0985
55.8%
1780
44.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0985
55.8%
1780
44.2%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1313 
1
452 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01313
74.4%
1452
 
25.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01313
74.4%
1452
 
25.6%

Most occurring characters

ValueCountFrequency (%)
01313
74.4%
1452
 
25.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01313
74.4%
1452
 
25.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01313
74.4%
1452
 
25.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01313
74.4%
1452
 
25.6%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1003 
1
762 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01003
56.8%
1762
43.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01003
56.8%
1762
43.2%

Most occurring characters

ValueCountFrequency (%)
01003
56.8%
1762
43.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01003
56.8%
1762
43.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01003
56.8%
1762
43.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01003
56.8%
1762
43.2%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1627 
1
 
138

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01627
92.2%
1138
 
7.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01627
92.2%
1138
 
7.8%

Most occurring characters

ValueCountFrequency (%)
01627
92.2%
1138
 
7.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01627
92.2%
1138
 
7.8%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01627
92.2%
1138
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01627
92.2%
1138
 
7.8%

('P33', 'other')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1599 
1
166 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01599
90.6%
1166
 
9.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01599
90.6%
1166
 
9.4%

Most occurring characters

ValueCountFrequency (%)
01599
90.6%
1166
 
9.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01599
90.6%
1166
 
9.4%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01599
90.6%
1166
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01599
90.6%
1166
 
9.4%

('P34', 'udacity')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1238 
1
527 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
01238
70.1%
1527
29.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01238
70.1%
1527
29.9%

Most occurring characters

ValueCountFrequency (%)
01238
70.1%
1527
29.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01238
70.1%
1527
29.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01238
70.1%
1527
29.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01238
70.1%
1527
29.9%

('P34', 'coursera')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1072 
1
693 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
01072
60.7%
1693
39.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01072
60.7%
1693
39.3%

Most occurring characters

ValueCountFrequency (%)
01072
60.7%
1693
39.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01072
60.7%
1693
39.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01072
60.7%
1693
39.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01072
60.7%
1693
39.3%

('P34', 'udemy')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
1
1125 
0
640 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
11125
63.7%
0640
36.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
11125
63.7%
0640
36.3%

Most occurring characters

ValueCountFrequency (%)
11125
63.7%
0640
36.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
11125
63.7%
0640
36.3%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
11125
63.7%
0640
36.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11125
63.7%
0640
36.3%

('P34', 'height')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1464 
1
301 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
01464
82.9%
1301
 
17.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01464
82.9%
1301
 
17.1%

Most occurring characters

ValueCountFrequency (%)
01464
82.9%
1301
 
17.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01464
82.9%
1301
 
17.1%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01464
82.9%
1301
 
17.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01464
82.9%
1301
 
17.1%

('P34', 'edx')
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1471 
1
294 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01471
83.3%
1294
 
16.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01471
83.3%
1294
 
16.7%

Most occurring characters

ValueCountFrequency (%)
01471
83.3%
1294
 
16.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01471
83.3%
1294
 
16.7%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01471
83.3%
1294
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01471
83.3%
1294
 
16.7%

('P34', 'data_camp')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1373 
1
392 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01373
77.8%
1392
 
22.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01373
77.8%
1392
 
22.2%

Most occurring characters

ValueCountFrequency (%)
01373
77.8%
1392
 
22.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01373
77.8%
1392
 
22.2%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01373
77.8%
1392
 
22.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01373
77.8%
1392
 
22.2%

('P34', 'data_quest')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1701 
1
 
64

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring characters

ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01701
96.4%
164
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01701
96.4%
164
 
3.6%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1449 
1
316 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01449
82.1%
1316
 
17.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01449
82.1%
1316
 
17.9%

Most occurring characters

ValueCountFrequency (%)
01449
82.1%
1316
 
17.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01449
82.1%
1316
 
17.9%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01449
82.1%
1316
 
17.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01449
82.1%
1316
 
17.9%

('P34', 'online_courses')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1597 
1
168 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01597
90.5%
1168
 
9.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01597
90.5%
1168
 
9.5%

Most occurring characters

ValueCountFrequency (%)
01597
90.5%
1168
 
9.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01597
90.5%
1168
 
9.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01597
90.5%
1168
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01597
90.5%
1168
 
9.5%

('P34', 'other')
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
0
1527 
1
238 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1765
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01527
86.5%
1238
 
13.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
01527
86.5%
1238
 
13.5%

Most occurring characters

ValueCountFrequency (%)
01527
86.5%
1238
 
13.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1765
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01527
86.5%
1238
 
13.5%

Most occurring scripts

ValueCountFrequency (%)
Common1765
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01527
86.5%
1238
 
13.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01527
86.5%
1238
 
13.5%

('P35', 'data_science_plataforms_preference')
Categorical

HIGH CORRELATION
MISSING

Distinct9
Distinct (%)0.6%
Missing140
Missing (%)7.9%
Memory size13.9 KiB
Udemy
466 
Coursera
309 
Udacity
238 
Nunca fiz cursos online
162 
DataCamp
162 
Other values (4)
288 

Length

Max length23
Median length12
Mean length8.270769231
Min length3

Characters and Unicode

Total characters13440
Distinct characters28
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNunca fiz cursos online
2nd rowUdemy
3rd rowAlura
4th rowUdemy
5th rowUdemy

Common Values

ValueCountFrequency (%)
Udemy466
26.4%
Coursera309
17.5%
Udacity238
13.5%
Nunca fiz cursos online162
 
9.2%
DataCamp162
 
9.2%
Alura138
 
7.8%
Kaggle Learn74
 
4.2%
edX52
 
2.9%
DataQuest24
 
1.4%
(Missing)140
 
7.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
udemy466
21.3%
coursera309
14.1%
udacity238
10.9%
nunca162
 
7.4%
fiz162
 
7.4%
cursos162
 
7.4%
online162
 
7.4%
datacamp162
 
7.4%
alura138
 
6.3%
kaggle74
 
3.4%
Other values (3)150
 
6.9%

Most occurring characters

ValueCountFrequency (%)
a1529
 
11.4%
e1161
 
8.6%
r992
 
7.4%
u795
 
5.9%
d756
 
5.6%
U704
 
5.2%
y704
 
5.2%
s657
 
4.9%
o633
 
4.7%
m628
 
4.7%
Other values (18)4881
36.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter10995
81.8%
Uppercase Letter1885
 
14.0%
Space Separator560
 
4.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a1529
13.9%
e1161
10.6%
r992
 
9.0%
u795
 
7.2%
d756
 
6.9%
y704
 
6.4%
s657
 
6.0%
o633
 
5.8%
m628
 
5.7%
c562
 
5.1%
Other values (8)2578
23.4%
Uppercase Letter
ValueCountFrequency (%)
U704
37.3%
C471
25.0%
D186
 
9.9%
N162
 
8.6%
A138
 
7.3%
K74
 
3.9%
L74
 
3.9%
X52
 
2.8%
Q24
 
1.3%
Space Separator
ValueCountFrequency (%)
560
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin12880
95.8%
Common560
 
4.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a1529
 
11.9%
e1161
 
9.0%
r992
 
7.7%
u795
 
6.2%
d756
 
5.9%
U704
 
5.5%
y704
 
5.5%
s657
 
5.1%
o633
 
4.9%
m628
 
4.9%
Other values (17)4321
33.5%
Common
ValueCountFrequency (%)
560
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII13440
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a1529
 
11.4%
e1161
 
8.6%
r992
 
7.4%
u795
 
5.9%
d756
 
5.6%
U704
 
5.2%
y704
 
5.2%
s657
 
4.9%
o633
 
4.7%
m628
 
4.7%
Other values (18)4881
36.3%

('P35', 'other')
Categorical

HIGH CARDINALITY
HIGH CORRELATION
MISSING

Distinct70
Distinct (%)50.0%
Missing1625
Missing (%)92.1%
Memory size13.9 KiB
Data Science Academy
43 
DSA
14 
DataScienceAcademy
 
5
Datascienceacademy
 
4
Data science academy
 
3
Other values (65)
71 

Length

Max length78
Median length58
Mean length15.82857143
Min length3

Characters and Unicode

Total characters2216
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)42.9%

Sample

1st rowData Science Academy
2nd rowDigital House
3rd rowCognitive Class.ai
4th rowDSA
5th rowcognitiveclass.ai

Common Values

ValueCountFrequency (%)
Data Science Academy43
 
2.4%
DSA14
 
0.8%
DataScienceAcademy5
 
0.3%
Datascienceacademy4
 
0.2%
Data science academy3
 
0.2%
Minerando Dados3
 
0.2%
Data science Academy2
 
0.1%
Nenhuma2
 
0.1%
data science academy2
 
0.1%
Minerando dados2
 
0.1%
Other values (60)60
 
3.4%
(Missing)1625
92.1%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
academy59
18.4%
data58
18.1%
science55
17.1%
dsa18
 
5.6%
datascienceacademy10
 
3.1%
dados7
 
2.2%
não7
 
2.2%
minerando7
 
2.2%
tenho4
 
1.2%
nenhuma3
 
0.9%
Other values (76)93
29.0%

Most occurring characters

ValueCountFrequency (%)
a298
13.4%
e266
12.0%
c226
 
10.2%
186
 
8.4%
i126
 
5.7%
d115
 
5.2%
n115
 
5.2%
t97
 
4.4%
D94
 
4.2%
m85
 
3.8%
Other values (41)608
27.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1698
76.6%
Uppercase Letter322
 
14.5%
Space Separator186
 
8.4%
Other Punctuation10
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a298
17.6%
e266
15.7%
c226
13.3%
i126
7.4%
d115
 
6.8%
n115
 
6.8%
t97
 
5.7%
m85
 
5.0%
y74
 
4.4%
o70
 
4.1%
Other values (19)226
13.3%
Uppercase Letter
ValueCountFrequency (%)
D94
29.2%
A80
24.8%
S78
24.2%
N10
 
3.1%
M9
 
2.8%
I8
 
2.5%
C8
 
2.5%
P6
 
1.9%
L5
 
1.6%
B4
 
1.2%
Other values (9)20
 
6.2%
Other Punctuation
ValueCountFrequency (%)
.9
90.0%
,1
 
10.0%
Space Separator
ValueCountFrequency (%)
186
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2020
91.2%
Common196
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a298
14.8%
e266
13.2%
c226
11.2%
i126
 
6.2%
d115
 
5.7%
n115
 
5.7%
t97
 
4.8%
D94
 
4.7%
m85
 
4.2%
A80
 
4.0%
Other values (38)518
25.6%
Common
ValueCountFrequency (%)
186
94.9%
.9
 
4.6%
,1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII2199
99.2%
None17
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a298
13.6%
e266
12.1%
c226
 
10.3%
186
 
8.5%
i126
 
5.7%
d115
 
5.2%
n115
 
5.2%
t97
 
4.4%
D94
 
4.3%
m85
 
3.9%
Other values (36)591
26.9%
None
ValueCountFrequency (%)
ã9
52.9%
ç3
 
17.6%
ó2
 
11.8%
ê2
 
11.8%
á1
 
5.9%
Distinct2
Distinct (%)0.1%
Missing4
Missing (%)0.2%
Memory size13.9 KiB
1.0
1432 
0.0
329 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters5283
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row0.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.01432
81.1%
0.0329
 
18.6%
(Missing)4
 
0.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.01432
81.3%
0.0329
 
18.7%

Most occurring characters

ValueCountFrequency (%)
02090
39.6%
.1761
33.3%
11432
27.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3522
66.7%
Other Punctuation1761
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
02090
59.3%
11432
40.7%
Other Punctuation
ValueCountFrequency (%)
.1761
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common5283
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
02090
39.6%
.1761
33.3%
11432
27.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII5283
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
02090
39.6%
.1761
33.3%
11432
27.1%

('D1', 'living_macroregion')
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.1%
Missing337
Missing (%)19.1%
Memory size13.9 KiB
Região Sudeste
1159 
Região Sul
269 

Length

Max length14
Median length14
Mean length13.2464986
Min length10

Characters and Unicode

Total characters18916
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRegião Sudeste
2nd rowRegião Sudeste
3rd rowRegião Sudeste
4th rowRegião Sudeste
5th rowRegião Sul

Common Values

ValueCountFrequency (%)
Região Sudeste1159
65.7%
Região Sul269
 
15.2%
(Missing)337
 
19.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
região1428
50.0%
sudeste1159
40.6%
sul269
 
9.4%

Most occurring characters

ValueCountFrequency (%)
e3746
19.8%
R1428
 
7.5%
g1428
 
7.5%
i1428
 
7.5%
ã1428
 
7.5%
o1428
 
7.5%
1428
 
7.5%
S1428
 
7.5%
u1428
 
7.5%
d1159
 
6.1%
Other values (3)2587
13.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter14632
77.4%
Uppercase Letter2856
 
15.1%
Space Separator1428
 
7.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e3746
25.6%
g1428
 
9.8%
i1428
 
9.8%
ã1428
 
9.8%
o1428
 
9.8%
u1428
 
9.8%
d1159
 
7.9%
s1159
 
7.9%
t1159
 
7.9%
l269
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
R1428
50.0%
S1428
50.0%
Space Separator
ValueCountFrequency (%)
1428
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin17488
92.5%
Common1428
 
7.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e3746
21.4%
R1428
 
8.2%
g1428
 
8.2%
i1428
 
8.2%
ã1428
 
8.2%
o1428
 
8.2%
S1428
 
8.2%
u1428
 
8.2%
d1159
 
6.6%
s1159
 
6.6%
Other values (2)1428
 
8.2%
Common
ValueCountFrequency (%)
1428
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII17488
92.5%
None1428
 
7.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e3746
21.4%
R1428
 
8.2%
g1428
 
8.2%
i1428
 
8.2%
o1428
 
8.2%
1428
 
8.2%
S1428
 
8.2%
u1428
 
8.2%
d1159
 
6.6%
s1159
 
6.6%
Other values (2)1428
 
8.2%
None
ValueCountFrequency (%)
ã1428
100.0%

('D2', 'origin_macroregion')
Categorical

HIGH CORRELATION
MISSING

Distinct5
Distinct (%)1.5%
Missing1440
Missing (%)81.6%
Memory size13.9 KiB
Região Sudeste
154 
Região Nordeste
77 
Região Sul
51 
Região Centro-Oeste
26 
Região Norte
17 

Length

Max length19
Median length15
Mean length13.90461538
Min length10

Characters and Unicode

Total characters4519
Distinct characters19
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRegião Sudeste
2nd rowRegião Nordeste
3rd rowRegião Nordeste
4th rowRegião Nordeste
5th rowRegião Nordeste

Common Values

ValueCountFrequency (%)
Região Sudeste154
 
8.7%
Região Nordeste77
 
4.4%
Região Sul51
 
2.9%
Região Centro-Oeste26
 
1.5%
Região Norte17
 
1.0%
(Missing)1440
81.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
região325
50.0%
sudeste154
23.7%
nordeste77
 
11.8%
sul51
 
7.8%
centro-oeste26
 
4.0%
norte17
 
2.6%

Most occurring characters

ValueCountFrequency (%)
e882
19.5%
o445
9.8%
R325
 
7.2%
g325
 
7.2%
i325
 
7.2%
ã325
 
7.2%
325
 
7.2%
t300
 
6.6%
s257
 
5.7%
d231
 
5.1%
Other values (9)779
17.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter3492
77.3%
Uppercase Letter676
 
15.0%
Space Separator325
 
7.2%
Dash Punctuation26
 
0.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e882
25.3%
o445
12.7%
g325
 
9.3%
i325
 
9.3%
ã325
 
9.3%
t300
 
8.6%
s257
 
7.4%
d231
 
6.6%
u205
 
5.9%
r120
 
3.4%
Other values (2)77
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
R325
48.1%
S205
30.3%
N94
 
13.9%
C26
 
3.8%
O26
 
3.8%
Space Separator
ValueCountFrequency (%)
325
100.0%
Dash Punctuation
ValueCountFrequency (%)
-26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4168
92.2%
Common351
 
7.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e882
21.2%
o445
10.7%
R325
 
7.8%
g325
 
7.8%
i325
 
7.8%
ã325
 
7.8%
t300
 
7.2%
s257
 
6.2%
d231
 
5.5%
u205
 
4.9%
Other values (7)548
13.1%
Common
ValueCountFrequency (%)
325
92.6%
-26
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII4194
92.8%
None325
 
7.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e882
21.0%
o445
10.6%
R325
 
7.7%
g325
 
7.7%
i325
 
7.7%
325
 
7.7%
t300
 
7.2%
s257
 
6.1%
d231
 
5.5%
u205
 
4.9%
Other values (8)574
13.7%
None
ValueCountFrequency (%)
ã325
100.0%

('D3', 'anonymized_degree_area')
Categorical

HIGH CORRELATION
MISSING

Distinct8
Distinct (%)0.5%
Missing35
Missing (%)2.0%
Memory size13.9 KiB
Computação / Engenharia de Software / Sistemas de Informação
1013 
Outras Engenharias
247 
Economia/ Administração / Contabilidade / Finanças
174 
Estatística/ Matemática / Matemática Computacional
104 
Outras
 
92
Other values (3)
 
100

Length

Max length60
Median length60
Mean length47.90520231
Min length6

Characters and Unicode

Total characters82876
Distinct characters38
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowComputação / Engenharia de Software / Sistemas de Informação
2nd rowComputação / Engenharia de Software / Sistemas de Informação
3rd rowOutras Engenharias
4th rowComputação / Engenharia de Software / Sistemas de Informação
5th rowComputação / Engenharia de Software / Sistemas de Informação

Common Values

ValueCountFrequency (%)
Computação / Engenharia de Software / Sistemas de Informação1013
57.4%
Outras Engenharias247
 
14.0%
Economia/ Administração / Contabilidade / Finanças174
 
9.9%
Estatística/ Matemática / Matemática Computacional104
 
5.9%
Outras92
 
5.2%
Marketing / Publicidade / Comunicação / Jornalismo47
 
2.7%
Química / Física28
 
1.6%
Ciências Sociais25
 
1.4%
(Missing)35
 
2.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
2647
22.6%
de2026
17.3%
computação1013
 
8.6%
engenharia1013
 
8.6%
software1013
 
8.6%
sistemas1013
 
8.6%
informação1013
 
8.6%
outras339
 
2.9%
engenharias247
 
2.1%
matemática208
 
1.8%
Other values (14)1198
10.2%

Most occurring characters

ValueCountFrequency (%)
10000
 
12.1%
a9081
 
11.0%
o6182
 
7.5%
e5788
 
7.0%
n4673
 
5.6%
t4605
 
5.6%
i4124
 
5.0%
r3893
 
4.7%
m3821
 
4.6%
s3293
 
4.0%
Other values (28)27416
33.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter62894
75.9%
Space Separator10000
 
12.1%
Uppercase Letter7057
 
8.5%
Other Punctuation2925
 
3.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a9081
14.4%
o6182
9.8%
e5788
 
9.2%
n4673
 
7.4%
t4605
 
7.3%
i4124
 
6.6%
r3893
 
6.2%
m3821
 
6.1%
s3293
 
5.2%
d2642
 
4.2%
Other values (15)14792
23.5%
Uppercase Letter
ValueCountFrequency (%)
S2051
29.1%
E1538
21.8%
C1363
19.3%
I1013
14.4%
O339
 
4.8%
M255
 
3.6%
F202
 
2.9%
A174
 
2.5%
P47
 
0.7%
J47
 
0.7%
Space Separator
ValueCountFrequency (%)
10000
100.0%
Other Punctuation
ValueCountFrequency (%)
/2925
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin69951
84.4%
Common12925
 
15.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a9081
13.0%
o6182
 
8.8%
e5788
 
8.3%
n4673
 
6.7%
t4605
 
6.6%
i4124
 
5.9%
r3893
 
5.6%
m3821
 
5.5%
s3293
 
4.7%
d2642
 
3.8%
Other values (26)21849
31.2%
Common
ValueCountFrequency (%)
10000
77.4%
/2925
 
22.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII77815
93.9%
None5061
 
6.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10000
12.9%
a9081
11.7%
o6182
 
7.9%
e5788
 
7.4%
n4673
 
6.0%
t4605
 
5.9%
i4124
 
5.3%
r3893
 
5.0%
m3821
 
4.9%
s3293
 
4.2%
Other values (23)22355
28.7%
None
ValueCountFrequency (%)
ç2421
47.8%
ã2247
44.4%
á208
 
4.1%
í160
 
3.2%
ê25
 
0.5%

('D4', 'anonymized_market_sector')
Categorical

HIGH CORRELATION
MISSING

Distinct17
Distinct (%)1.1%
Missing243
Missing (%)13.8%
Memory size13.9 KiB
Tecnologia/Fábrica de Software
501 
Outras
183 
Finanças ou Bancos
182 
Setor Público
89 
Educação
86 
Other values (12)
481 

Length

Max length30
Median length22
Mean length18.5978975
Min length6

Characters and Unicode

Total characters28306
Distinct characters44
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutras
2nd rowEducação
3rd rowIndústria (Manufatura)
4th rowTecnologia/Fábrica de Software
5th rowInternet/Ecommerce

Common Values

ValueCountFrequency (%)
Tecnologia/Fábrica de Software501
28.4%
Outras183
 
10.4%
Finanças ou Bancos182
 
10.3%
Setor Público89
 
5.0%
Educação86
 
4.9%
Indústria (Manufatura)78
 
4.4%
Varejo73
 
4.1%
Marketing72
 
4.1%
Área da Saúde63
 
3.6%
Internet/Ecommerce60
 
3.4%
Other values (7)135
 
7.6%
(Missing)243
13.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
tecnologia/fábrica501
15.2%
de501
15.2%
software501
15.2%
ou216
 
6.6%
outras183
 
5.6%
finanças182
 
5.5%
bancos182
 
5.5%
setor132
 
4.0%
público89
 
2.7%
educação86
 
2.6%
Other values (17)719
21.8%

Most occurring characters

ValueCountFrequency (%)
a3036
 
10.7%
o2595
 
9.2%
e2371
 
8.4%
r1897
 
6.7%
1770
 
6.3%
c1622
 
5.7%
i1586
 
5.6%
n1538
 
5.4%
t1304
 
4.6%
d806
 
2.8%
Other values (34)9781
34.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter22746
80.4%
Uppercase Letter3073
 
10.9%
Space Separator1770
 
6.3%
Other Punctuation561
 
2.0%
Close Punctuation78
 
0.3%
Open Punctuation78
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a3036
13.3%
o2595
11.4%
e2371
10.4%
r1897
 
8.3%
c1622
 
7.1%
i1586
 
7.0%
n1538
 
6.8%
t1304
 
5.7%
d806
 
3.5%
u725
 
3.2%
Other values (18)5266
23.2%
Uppercase Letter
ValueCountFrequency (%)
S711
23.1%
F692
22.5%
T540
17.6%
E184
 
6.0%
O183
 
6.0%
B182
 
5.9%
M150
 
4.9%
I138
 
4.5%
P104
 
3.4%
V73
 
2.4%
Other values (2)116
 
3.8%
Space Separator
ValueCountFrequency (%)
1770
100.0%
Other Punctuation
ValueCountFrequency (%)
/561
100.0%
Close Punctuation
ValueCountFrequency (%)
)78
100.0%
Open Punctuation
ValueCountFrequency (%)
(78
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin25819
91.2%
Common2487
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a3036
 
11.8%
o2595
 
10.1%
e2371
 
9.2%
r1897
 
7.3%
c1622
 
6.3%
i1586
 
6.1%
n1538
 
6.0%
t1304
 
5.1%
d806
 
3.1%
u725
 
2.8%
Other values (30)8339
32.3%
Common
ValueCountFrequency (%)
1770
71.2%
/561
 
22.6%
)78
 
3.1%
(78
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII27033
95.5%
None1273
 
4.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a3036
 
11.2%
o2595
 
9.6%
e2371
 
8.8%
r1897
 
7.0%
1770
 
6.5%
c1622
 
6.0%
i1586
 
5.9%
n1538
 
5.7%
t1304
 
4.8%
d806
 
3.0%
Other values (26)8508
31.5%
None
ValueCountFrequency (%)
á501
39.4%
ç307
24.1%
ú230
18.1%
ã125
 
9.8%
Á63
 
4.9%
ó19
 
1.5%
ê15
 
1.2%
í13
 
1.0%

('D5', 'anonymized_manager_level')
Categorical

HIGH CORRELATION
MISSING

Distinct8
Distinct (%)2.6%
Missing1460
Missing (%)82.7%
Memory size13.9 KiB
Coordenador
73 
Gerente
60 
Team Leader/Tech Leader
53 
C-level (CDO, CIO, CTO)
38 
Head
26 
Other values (3)
55 

Length

Max length23
Median length11
Mean length12.68196721
Min length4

Characters and Unicode

Total characters3868
Distinct characters31
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowC-level (CDO, CIO, CTO)
2nd rowTeam Leader/Tech Leader
3rd rowCoordenador
4th rowCoordenador
5th rowC-level (CDO, CIO, CTO)

Common Values

ValueCountFrequency (%)
Coordenador73
 
4.1%
Gerente60
 
3.4%
Team Leader/Tech Leader53
 
3.0%
C-level (CDO, CIO, CTO)38
 
2.2%
Head26
 
1.5%
Supervisor24
 
1.4%
Diretor22
 
1.2%
Outras9
 
0.5%
(Missing)1460
82.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
coordenador73
13.9%
gerente60
11.4%
team53
10.1%
leader/tech53
10.1%
leader53
10.1%
c-level38
7.2%
cdo38
7.2%
cio38
7.2%
cto38
7.2%
head26
 
5.0%
Other values (3)55
10.5%

Most occurring characters

ValueCountFrequency (%)
e719
18.6%
r413
 
10.7%
d278
 
7.2%
a267
 
6.9%
o265
 
6.9%
C225
 
5.8%
220
 
5.7%
T144
 
3.7%
n133
 
3.4%
O123
 
3.2%
Other values (21)1081
27.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2599
67.2%
Uppercase Letter806
 
20.8%
Space Separator220
 
5.7%
Other Punctuation129
 
3.3%
Dash Punctuation38
 
1.0%
Open Punctuation38
 
1.0%
Close Punctuation38
 
1.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e719
27.7%
r413
15.9%
d278
 
10.7%
a267
 
10.3%
o265
 
10.2%
n133
 
5.1%
t91
 
3.5%
l76
 
2.9%
v62
 
2.4%
h53
 
2.0%
Other values (6)242
 
9.3%
Uppercase Letter
ValueCountFrequency (%)
C225
27.9%
T144
17.9%
O123
15.3%
L106
13.2%
D60
 
7.4%
G60
 
7.4%
I38
 
4.7%
H26
 
3.2%
S24
 
3.0%
Other Punctuation
ValueCountFrequency (%)
,76
58.9%
/53
41.1%
Space Separator
ValueCountFrequency (%)
220
100.0%
Dash Punctuation
ValueCountFrequency (%)
-38
100.0%
Open Punctuation
ValueCountFrequency (%)
(38
100.0%
Close Punctuation
ValueCountFrequency (%)
)38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin3405
88.0%
Common463
 
12.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e719
21.1%
r413
12.1%
d278
 
8.2%
a267
 
7.8%
o265
 
7.8%
C225
 
6.6%
T144
 
4.2%
n133
 
3.9%
O123
 
3.6%
L106
 
3.1%
Other values (15)732
21.5%
Common
ValueCountFrequency (%)
220
47.5%
,76
 
16.4%
/53
 
11.4%
-38
 
8.2%
(38
 
8.2%
)38
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII3868
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e719
18.6%
r413
 
10.7%
d278
 
7.2%
a267
 
6.9%
o265
 
6.9%
C225
 
5.8%
220
 
5.7%
T144
 
3.7%
n133
 
3.4%
O123
 
3.2%
Other values (21)1081
27.9%

('D6', 'anonymized_role')
Categorical

HIGH CORRELATION
MISSING

Distinct14
Distinct (%)1.1%
Missing514
Missing (%)29.1%
Memory size13.9 KiB
Desenvolvedor ou Engenheiro de Software
225 
Outras
220 
Data Scientist/Cientista de Dados
167 
Data Analyst/Analista de Dados
163 
Business Intelligence/Analista de BI
150 
Other values (9)
326 

Length

Max length39
Median length36
Mean length28.52517986
Min length6

Characters and Unicode

Total characters35685
Distinct characters36
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutras
2nd rowData Analyst/Analista de Dados
3rd rowOutras
4th rowBusiness Intelligence/Analista de BI
5th rowOutras

Common Values

ValueCountFrequency (%)
Desenvolvedor ou Engenheiro de Software225
12.7%
Outras220
12.5%
Data Scientist/Cientista de Dados167
 
9.5%
Data Analyst/Analista de Dados163
 
9.2%
Business Intelligence/Analista de BI150
 
8.5%
Data Engineer/Engenheiro de Dados130
 
7.4%
Business Analyst/Analista de Negócios72
 
4.1%
Analista de Inteligência de Mercado29
 
1.6%
Engenheiro26
 
1.5%
Analista de Marketing19
 
1.1%
Other values (4)50
 
2.8%
(Missing)514
29.1%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
de1027
23.1%
dados474
10.6%
data460
10.3%
engenheiro266
 
6.0%
analyst/analista235
 
5.3%
desenvolvedor225
 
5.1%
ou225
 
5.1%
software225
 
5.1%
business222
 
5.0%
outras220
 
4.9%
Other values (15)873
19.6%

Most occurring characters

ValueCountFrequency (%)
e4164
11.7%
a3263
 
9.1%
3201
 
9.0%
n2961
 
8.3%
s2705
 
7.6%
t2496
 
7.0%
i2227
 
6.2%
o1930
 
5.4%
d1783
 
5.0%
r1287
 
3.6%
Other values (26)9668
27.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter27714
77.7%
Uppercase Letter4074
 
11.4%
Space Separator3201
 
9.0%
Other Punctuation696
 
2.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e4164
15.0%
a3263
11.8%
n2961
10.7%
s2705
9.8%
t2496
9.0%
i2227
8.0%
o1930
7.0%
d1783
6.4%
r1287
 
4.6%
l1222
 
4.4%
Other values (13)3676
13.3%
Uppercase Letter
ValueCountFrequency (%)
D1173
28.8%
A696
17.1%
E547
13.4%
B400
 
9.8%
S392
 
9.6%
I329
 
8.1%
O220
 
5.4%
C167
 
4.1%
N72
 
1.8%
M63
 
1.5%
Space Separator
ValueCountFrequency (%)
3201
100.0%
Other Punctuation
ValueCountFrequency (%)
/696
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin31788
89.1%
Common3897
 
10.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e4164
13.1%
a3263
10.3%
n2961
 
9.3%
s2705
 
8.5%
t2496
 
7.9%
i2227
 
7.0%
o1930
 
6.1%
d1783
 
5.6%
r1287
 
4.0%
l1222
 
3.8%
Other values (24)7750
24.4%
Common
ValueCountFrequency (%)
3201
82.1%
/696
 
17.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII35573
99.7%
None112
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e4164
11.7%
a3263
 
9.2%
3201
 
9.0%
n2961
 
8.3%
s2705
 
7.6%
t2496
 
7.0%
i2227
 
6.3%
o1930
 
5.4%
d1783
 
5.0%
r1287
 
3.6%
Other values (23)9556
26.9%
None
ValueCountFrequency (%)
ó72
64.3%
ê29
25.9%
í11
 
9.8%

profissao
Categorical

HIGH CORRELATION
MISSING

Distinct4
Distinct (%)0.4%
Missing821
Missing (%)46.5%
Memory size13.9 KiB
Outras
265 
Analista de BI
251 
Desenvolvedor ou Engenheiro de Software
225 
Cientista de Dados
203 

Length

Max length39
Median length18
Mean length18.57309322
Min length6

Characters and Unicode

Total characters17533
Distinct characters25
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutras
2nd rowOutras
3rd rowAnalista de BI
4th rowOutras
5th rowDesenvolvedor ou Engenheiro de Software

Common Values

ValueCountFrequency (%)
Outras265
 
15.0%
Analista de BI251
 
14.2%
Desenvolvedor ou Engenheiro de Software225
 
12.7%
Cientista de Dados203
 
11.5%
(Missing)821
46.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
de679
24.7%
outras265
 
9.6%
analista251
 
9.1%
bi251
 
9.1%
desenvolvedor225
 
8.2%
ou225
 
8.2%
engenheiro225
 
8.2%
software225
 
8.2%
cientista203
 
7.4%
dados203
 
7.4%

Most occurring characters

ValueCountFrequency (%)
e2232
12.7%
1808
10.3%
a1398
 
8.0%
o1328
 
7.6%
t1147
 
6.5%
s1147
 
6.5%
n1129
 
6.4%
d1107
 
6.3%
r940
 
5.4%
i882
 
5.0%
Other values (15)4415
25.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter13626
77.7%
Uppercase Letter2099
 
12.0%
Space Separator1808
 
10.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e2232
16.4%
a1398
10.3%
o1328
9.7%
t1147
8.4%
s1147
8.4%
n1129
8.3%
d1107
8.1%
r940
6.9%
i882
 
6.5%
u490
 
3.6%
Other values (6)1826
13.4%
Uppercase Letter
ValueCountFrequency (%)
D428
20.4%
O265
12.6%
I251
12.0%
B251
12.0%
A251
12.0%
E225
10.7%
S225
10.7%
C203
9.7%
Space Separator
ValueCountFrequency (%)
1808
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin15725
89.7%
Common1808
 
10.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e2232
14.2%
a1398
 
8.9%
o1328
 
8.4%
t1147
 
7.3%
s1147
 
7.3%
n1129
 
7.2%
d1107
 
7.0%
r940
 
6.0%
i882
 
5.6%
u490
 
3.1%
Other values (14)3925
25.0%
Common
ValueCountFrequency (%)
1808
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII17533
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e2232
12.7%
1808
10.3%
a1398
 
8.0%
o1328
 
7.6%
t1147
 
6.5%
s1147
 
6.5%
n1129
 
6.4%
d1107
 
6.3%
r940
 
5.4%
i882
 
5.0%
Other values (15)4415
25.2%

idade
Categorical

HIGH CORRELATION
MISSING

Distinct4
Distinct (%)0.2%
Missing24
Missing (%)1.4%
Memory size2.0 KiB
[25,30]
650 
[31,40]
559 
[18,24]
399 
[41,50]
133 

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters12187
Distinct characters10
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[31,40]
2nd row[18,24]
3rd row[25,30]
4th row[18,24]
5th row[25,30]

Common Values

ValueCountFrequency (%)
[25,30]650
36.8%
[31,40]559
31.7%
[18,24]399
22.6%
[41,50]133
 
7.5%
(Missing)24
 
1.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
25,30650
37.3%
31,40559
32.1%
18,24399
22.9%
41,50133
 
7.6%

Most occurring characters

ValueCountFrequency (%)
[1741
14.3%
,1741
14.3%
]1741
14.3%
01342
11.0%
31209
9.9%
11091
9.0%
41091
9.0%
21049
8.6%
5783
6.4%
8399
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number6964
57.1%
Open Punctuation1741
 
14.3%
Other Punctuation1741
 
14.3%
Close Punctuation1741
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01342
19.3%
31209
17.4%
11091
15.7%
41091
15.7%
21049
15.1%
5783
11.2%
8399
 
5.7%
Open Punctuation
ValueCountFrequency (%)
[1741
100.0%
Other Punctuation
ValueCountFrequency (%)
,1741
100.0%
Close Punctuation
ValueCountFrequency (%)
]1741
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common12187
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
[1741
14.3%
,1741
14.3%
]1741
14.3%
01342
11.0%
31209
9.9%
11091
9.0%
41091
9.0%
21049
8.6%
5783
6.4%
8399
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII12187
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
[1741
14.3%
,1741
14.3%
]1741
14.3%
01342
11.0%
31209
9.9%
11091
9.0%
41091
9.0%
21049
8.6%
5783
6.4%
8399
 
3.3%

salario
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct11
Distinct (%)0.7%
Missing238
Missing (%)13.5%
Infinite0
Infinite (%)0.0%
Mean6283.235102
Minimum1000
Maximum25000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.9 KiB

Quantile statistics

Minimum1000
5-th percentile1500
Q13500
median5000
Q310000
95-th percentile14000
Maximum25000
Range24000
Interquartile range (IQR)6500

Descriptive statistics

Standard deviation4634.954013
Coefficient of variation (CV)0.7376699961
Kurtosis2.687305868
Mean6283.235102
Median Absolute Deviation (MAD)2500
Skewness1.52321921
Sum9594500
Variance21482798.7
MonotonicityNot monotonic
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
5000308
17.5%
10000237
13.4%
7000229
13.0%
3500219
12.4%
1500181
10.3%
2500150
8.5%
1400082
 
4.6%
100048
 
2.7%
1800045
 
2.5%
2250015
 
0.8%
(Missing)238
13.5%
ValueCountFrequency (%)
100048
 
2.7%
1500181
10.3%
2500150
8.5%
3500219
12.4%
5000308
17.5%
7000229
13.0%
10000237
13.4%
1400082
 
4.6%
1800045
 
2.5%
2250015
 
0.8%
ValueCountFrequency (%)
2500013
 
0.7%
2250015
 
0.8%
1800045
 
2.5%
1400082
 
4.6%
10000237
13.4%
7000229
13.0%
5000308
17.5%
3500219
12.4%
2500150
8.5%
1500181
10.3%

tamanho_da_empresa
Categorical

HIGH CORRELATION
MISSING

Distinct3
Distinct (%)0.2%
Missing366
Missing (%)20.7%
Memory size13.9 KiB
Grande
557 
Média
505 
Pequena
337 

Length

Max length7
Median length6
Mean length5.879914224
Min length5

Characters and Unicode

Total characters8226
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPequena
2nd rowGrande
3rd rowGrande
4th rowPequena
5th rowPequena

Common Values

ValueCountFrequency (%)
Grande557
31.6%
Média505
28.6%
Pequena337
19.1%
(Missing)366
20.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
grande557
39.8%
média505
36.1%
pequena337
24.1%

Most occurring characters

ValueCountFrequency (%)
a1399
17.0%
e1231
15.0%
d1062
12.9%
n894
10.9%
G557
 
6.8%
r557
 
6.8%
M505
 
6.1%
é505
 
6.1%
i505
 
6.1%
P337
 
4.1%
Other values (2)674
8.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter6827
83.0%
Uppercase Letter1399
 
17.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a1399
20.5%
e1231
18.0%
d1062
15.6%
n894
13.1%
r557
 
8.2%
é505
 
7.4%
i505
 
7.4%
q337
 
4.9%
u337
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
G557
39.8%
M505
36.1%
P337
24.1%

Most occurring scripts

ValueCountFrequency (%)
Latin8226
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a1399
17.0%
e1231
15.0%
d1062
12.9%
n894
10.9%
G557
 
6.8%
r557
 
6.8%
M505
 
6.1%
é505
 
6.1%
i505
 
6.1%
P337
 
4.1%
Other values (2)674
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII7721
93.9%
None505
 
6.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a1399
18.1%
e1231
15.9%
d1062
13.8%
n894
11.6%
G557
 
7.2%
r557
 
7.2%
M505
 
6.5%
i505
 
6.5%
P337
 
4.4%
q337
 
4.4%
None
ValueCountFrequency (%)
é505
100.0%

gestor
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.1%
Missing238
Missing (%)13.5%
Memory size13.9 KiB
não
1222 
sim
305 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters4581
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownão
2nd rownão
3rd rownão
4th rownão
5th rowsim

Common Values

ValueCountFrequency (%)
não1222
69.2%
sim305
 
17.3%
(Missing)238
 
13.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
não1222
80.0%
sim305
 
20.0%

Most occurring characters

ValueCountFrequency (%)
n1222
26.7%
ã1222
26.7%
o1222
26.7%
s305
 
6.7%
i305
 
6.7%
m305
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter4581
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n1222
26.7%
ã1222
26.7%
o1222
26.7%
s305
 
6.7%
i305
 
6.7%
m305
 
6.7%

Most occurring scripts

ValueCountFrequency (%)
Latin4581
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
n1222
26.7%
ã1222
26.7%
o1222
26.7%
s305
 
6.7%
i305
 
6.7%
m305
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII3359
73.3%
None1222
 
26.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n1222
36.4%
o1222
36.4%
s305
 
9.1%
i305
 
9.1%
m305
 
9.1%
None
ValueCountFrequency (%)
ã1222
100.0%

se_considera_ds
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
sim
915 
não
850 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters5295
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownão
2nd rowsim
3rd rowsim
4th rowsim
5th rowsim

Common Values

ValueCountFrequency (%)
sim915
51.8%
não850
48.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
sim915
51.8%
não850
48.2%

Most occurring characters

ValueCountFrequency (%)
s915
17.3%
i915
17.3%
m915
17.3%
n850
16.1%
ã850
16.1%
o850
16.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter5295
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s915
17.3%
i915
17.3%
m915
17.3%
n850
16.1%
ã850
16.1%
o850
16.1%

Most occurring scripts

ValueCountFrequency (%)
Latin5295
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
s915
17.3%
i915
17.3%
m915
17.3%
n850
16.1%
ã850
16.1%
o850
16.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII4445
83.9%
None850
 
16.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s915
20.6%
i915
20.6%
m915
20.6%
n850
19.1%
o850
19.1%
None
ValueCountFrequency (%)
ã850
100.0%

sexo
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing3
Missing (%)0.2%
Memory size13.9 KiB
Masculino
1436 
Feminino
326 

Length

Max length9
Median length9
Mean length8.814982974
Min length8

Characters and Unicode

Total characters15532
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMasculino
2nd rowFeminino
3rd rowMasculino
4th rowMasculino
5th rowMasculino

Common Values

ValueCountFrequency (%)
Masculino1436
81.4%
Feminino326
 
18.5%
(Missing)3
 
0.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
masculino1436
81.5%
feminino326
 
18.5%

Most occurring characters

ValueCountFrequency (%)
i2088
13.4%
n2088
13.4%
o1762
11.3%
M1436
9.2%
a1436
9.2%
s1436
9.2%
c1436
9.2%
u1436
9.2%
l1436
9.2%
F326
 
2.1%
Other values (2)652
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter13770
88.7%
Uppercase Letter1762
 
11.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i2088
15.2%
n2088
15.2%
o1762
12.8%
a1436
10.4%
s1436
10.4%
c1436
10.4%
u1436
10.4%
l1436
10.4%
e326
 
2.4%
m326
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
M1436
81.5%
F326
 
18.5%

Most occurring scripts

ValueCountFrequency (%)
Latin15532
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i2088
13.4%
n2088
13.4%
o1762
11.3%
M1436
9.2%
a1436
9.2%
s1436
9.2%
c1436
9.2%
u1436
9.2%
l1436
9.2%
F326
 
2.1%
Other values (2)652
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII15532
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i2088
13.4%
n2088
13.4%
o1762
11.3%
M1436
9.2%
a1436
9.2%
s1436
9.2%
c1436
9.2%
u1436
9.2%
l1436
9.2%
F326
 
2.1%
Other values (2)652
 
4.2%

experiencia_ds
Categorical

HIGH CORRELATION

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Menos de 1 ano
445 
de 1 a 2 anos
343 
de 2 a 3 anos
244 
Não tenho experiência na área de dados
221 
de 4 a 5 anos
186 
Other values (2)
326 

Length

Max length38
Median length15
Mean length16.65042493
Min length13

Characters and Unicode

Total characters29388
Distinct characters26
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão tenho experiência na área de dados
2nd rowMenos de 1 ano
3rd rowde 1 a 2 anos
4th rowMenos de 1 ano
5th rowde 4 a 5 anos

Common Values

ValueCountFrequency (%)
Menos de 1 ano445
25.2%
de 1 a 2 anos343
19.4%
de 2 a 3 anos244
13.8%
Não tenho experiência na área de dados221
12.5%
de 4 a 5 anos186
10.5%
de 6 a 10 anos179
10.1%
Mais de 10 anos147
 
8.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
de1765
20.3%
anos1099
12.7%
a952
11.0%
1788
9.1%
2587
 
6.8%
menos445
 
5.1%
ano445
 
5.1%
10326
 
3.8%
3244
 
2.8%
área221
 
2.5%
Other values (9)1803
20.8%

Most occurring characters

ValueCountFrequency (%)
6910
23.5%
a3527
12.0%
e3094
10.5%
n2652
 
9.0%
o2652
 
9.0%
d2207
 
7.5%
s1912
 
6.5%
11114
 
3.8%
M592
 
2.0%
i589
 
2.0%
Other values (16)4139
14.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter18843
64.1%
Space Separator6910
 
23.5%
Decimal Number2822
 
9.6%
Uppercase Letter813
 
2.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a3527
18.7%
e3094
16.4%
n2652
14.1%
o2652
14.1%
d2207
11.7%
s1912
10.1%
i589
 
3.1%
r442
 
2.3%
ã221
 
1.2%
h221
 
1.2%
Other values (6)1326
 
7.0%
Decimal Number
ValueCountFrequency (%)
11114
39.5%
2587
20.8%
0326
 
11.6%
3244
 
8.6%
4186
 
6.6%
5186
 
6.6%
6179
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
M592
72.8%
N221
 
27.2%
Space Separator
ValueCountFrequency (%)
6910
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin19656
66.9%
Common9732
33.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a3527
17.9%
e3094
15.7%
n2652
13.5%
o2652
13.5%
d2207
11.2%
s1912
9.7%
M592
 
3.0%
i589
 
3.0%
r442
 
2.2%
ã221
 
1.1%
Other values (8)1768
9.0%
Common
ValueCountFrequency (%)
6910
71.0%
11114
 
11.4%
2587
 
6.0%
0326
 
3.3%
3244
 
2.5%
4186
 
1.9%
5186
 
1.9%
6179
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII28725
97.7%
None663
 
2.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6910
24.1%
a3527
12.3%
e3094
10.8%
n2652
 
9.2%
o2652
 
9.2%
d2207
 
7.7%
s1912
 
6.7%
11114
 
3.9%
M592
 
2.1%
i589
 
2.1%
Other values (13)3476
12.1%
None
ValueCountFrequency (%)
ã221
33.3%
ê221
33.3%
á221
33.3%

tipo_de_trabalho
Categorical

HIGH CORRELATION

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Empregado (CTL)
1073 
Empreendedor ou Empregado (CNPJ)
234 
Estagiário
131 
Somente Estudante (graduação)
 
85
Desempregado, buscando recolocação
 
69
Other values (6)
173 

Length

Max length45
Median length15
Mean length19.27988669
Min length10

Characters and Unicode

Total characters34029
Distinct characters44
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEmpregado (CTL)
2nd rowEmpregado (CTL)
3rd rowEmpregado (CTL)
4th rowEstagiário
5th rowFreelancer

Common Values

ValueCountFrequency (%)
Empregado (CTL)1073
60.8%
Empreendedor ou Empregado (CNPJ)234
 
13.3%
Estagiário131
 
7.4%
Somente Estudante (graduação)85
 
4.8%
Desempregado, buscando recolocação69
 
3.9%
Servidor Público60
 
3.4%
Trabalho na área Acadêmica/Pesquisador45
 
2.5%
Somente Estudante (pós-graduação)36
 
2.0%
Freelancer23
 
1.3%
Prefiro não dizer6
 
0.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
empregado1307
31.6%
ctl1073
25.9%
empreendedor234
 
5.6%
ou234
 
5.6%
cnpj234
 
5.6%
estagiário131
 
3.2%
somente121
 
2.9%
estudante121
 
2.9%
graduação85
 
2.1%
desempregado72
 
1.7%
Other values (15)530
12.8%

Most occurring characters

ValueCountFrequency (%)
e2897
 
8.5%
o2736
 
8.0%
r2490
 
7.3%
2377
 
7.0%
a2355
 
6.9%
d2317
 
6.8%
E1793
 
5.3%
m1779
 
5.2%
p1649
 
4.8%
g1631
 
4.8%
Other values (34)12005
35.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter22221
65.3%
Uppercase Letter6425
 
18.9%
Space Separator2377
 
7.0%
Open Punctuation1428
 
4.2%
Close Punctuation1428
 
4.2%
Other Punctuation114
 
0.3%
Dash Punctuation36
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e2897
13.0%
o2736
12.3%
r2490
11.2%
a2355
10.6%
d2317
10.4%
m1779
8.0%
p1649
7.4%
g1631
7.3%
n625
 
2.8%
u596
 
2.7%
Other values (17)3146
14.2%
Uppercase Letter
ValueCountFrequency (%)
E1793
27.9%
C1307
20.3%
T1118
17.4%
L1073
16.7%
P345
 
5.4%
J234
 
3.6%
N234
 
3.6%
S181
 
2.8%
D72
 
1.1%
A45
 
0.7%
Other Punctuation
ValueCountFrequency (%)
,69
60.5%
/45
39.5%
Space Separator
ValueCountFrequency (%)
2377
100.0%
Open Punctuation
ValueCountFrequency (%)
(1428
100.0%
Close Punctuation
ValueCountFrequency (%)
)1428
100.0%
Dash Punctuation
ValueCountFrequency (%)
-36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin28646
84.2%
Common5383
 
15.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e2897
 
10.1%
o2736
 
9.6%
r2490
 
8.7%
a2355
 
8.2%
d2317
 
8.1%
E1793
 
6.3%
m1779
 
6.2%
p1649
 
5.8%
g1631
 
5.7%
C1307
 
4.6%
Other values (28)7692
26.9%
Common
ValueCountFrequency (%)
2377
44.2%
(1428
26.5%
)1428
26.5%
,69
 
1.3%
/45
 
0.8%
-36
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII33317
97.9%
None712
 
2.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e2897
 
8.7%
o2736
 
8.2%
r2490
 
7.5%
2377
 
7.1%
a2355
 
7.1%
d2317
 
7.0%
E1793
 
5.4%
m1779
 
5.3%
p1649
 
4.9%
g1631
 
4.9%
Other values (28)11293
33.9%
None
ValueCountFrequency (%)
ã202
28.4%
ç193
27.1%
á176
24.7%
ú60
 
8.4%
ê45
 
6.3%
ó36
 
5.1%

escolaridade
Categorical

HIGH CORRELATION

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
Graduação/Bacharelado
578 
Pós-graduação
527 
Estudante de Graduação
374 
Mestrado
201 
Doutorado ou Phd
 
50
Other values (2)
 
35

Length

Max length26
Median length22
Mean length17.29688385
Min length8

Characters and Unicode

Total characters30529
Distinct characters29
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowEstudante de Graduação
2nd rowEstudante de Graduação
3rd rowGraduação/Bacharelado
4th rowEstudante de Graduação
5th rowGraduação/Bacharelado

Common Values

ValueCountFrequency (%)
Graduação/Bacharelado578
32.7%
Pós-graduação527
29.9%
Estudante de Graduação374
21.2%
Mestrado201
 
11.4%
Doutorado ou Phd50
 
2.8%
Não tenho graduação formal34
 
1.9%
Prefiro não informar1
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
graduação/bacharelado578
21.3%
pós-graduação527
19.4%
graduação408
15.0%
estudante374
13.8%
de374
13.8%
mestrado201
 
7.4%
doutorado50
 
1.8%
ou50
 
1.8%
phd50
 
1.8%
não35
 
1.3%
Other values (4)70
 
2.6%

Most occurring characters

ValueCountFrequency (%)
a5420
17.8%
d3140
 
10.3%
o2597
 
8.5%
r2380
 
7.8%
u1987
 
6.5%
e1562
 
5.1%
ã1548
 
5.1%
ç1513
 
5.0%
s1102
 
3.6%
t1033
 
3.4%
Other values (19)8247
27.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter25705
84.2%
Uppercase Letter2767
 
9.1%
Space Separator952
 
3.1%
Other Punctuation578
 
1.9%
Dash Punctuation527
 
1.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a5420
21.1%
d3140
12.2%
o2597
10.1%
r2380
9.3%
u1987
 
7.7%
e1562
 
6.1%
ã1548
 
6.0%
ç1513
 
5.9%
s1102
 
4.3%
t1033
 
4.0%
Other values (9)3423
13.3%
Uppercase Letter
ValueCountFrequency (%)
G952
34.4%
P578
20.9%
B578
20.9%
E374
 
13.5%
M201
 
7.3%
D50
 
1.8%
N34
 
1.2%
Space Separator
ValueCountFrequency (%)
952
100.0%
Other Punctuation
ValueCountFrequency (%)
/578
100.0%
Dash Punctuation
ValueCountFrequency (%)
-527
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin28472
93.3%
Common2057
 
6.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a5420
19.0%
d3140
11.0%
o2597
9.1%
r2380
 
8.4%
u1987
 
7.0%
e1562
 
5.5%
ã1548
 
5.4%
ç1513
 
5.3%
s1102
 
3.9%
t1033
 
3.6%
Other values (16)6190
21.7%
Common
ValueCountFrequency (%)
952
46.3%
/578
28.1%
-527
25.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII26941
88.2%
None3588
 
11.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a5420
20.1%
d3140
11.7%
o2597
9.6%
r2380
8.8%
u1987
 
7.4%
e1562
 
5.8%
s1102
 
4.1%
t1033
 
3.8%
G952
 
3.5%
952
 
3.5%
Other values (16)5816
21.6%
None
ValueCountFrequency (%)
ã1548
43.1%
ç1513
42.2%
ó527
 
14.7%

area_de_formacao
Categorical

HIGH CORRELATION
MISSING

Distinct8
Distinct (%)0.5%
Missing35
Missing (%)2.0%
Memory size13.9 KiB
Computação / Engenharia de Software / Sistemas de Informação
1013 
Outras Engenharias
247 
Economia/ Administração / Contabilidade / Finanças
174 
Estatística/ Matemática / Matemática Computacional
104 
Outras
 
92
Other values (3)
 
100

Length

Max length60
Median length60
Mean length47.90520231
Min length6

Characters and Unicode

Total characters82876
Distinct characters38
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowComputação / Engenharia de Software / Sistemas de Informação
2nd rowComputação / Engenharia de Software / Sistemas de Informação
3rd rowOutras Engenharias
4th rowComputação / Engenharia de Software / Sistemas de Informação
5th rowComputação / Engenharia de Software / Sistemas de Informação

Common Values

ValueCountFrequency (%)
Computação / Engenharia de Software / Sistemas de Informação1013
57.4%
Outras Engenharias247
 
14.0%
Economia/ Administração / Contabilidade / Finanças174
 
9.9%
Estatística/ Matemática / Matemática Computacional104
 
5.9%
Outras92
 
5.2%
Marketing / Publicidade / Comunicação / Jornalismo47
 
2.7%
Química / Física28
 
1.6%
Ciências Sociais25
 
1.4%
(Missing)35
 
2.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
2647
22.6%
de2026
17.3%
computação1013
 
8.6%
engenharia1013
 
8.6%
software1013
 
8.6%
sistemas1013
 
8.6%
informação1013
 
8.6%
outras339
 
2.9%
engenharias247
 
2.1%
matemática208
 
1.8%
Other values (14)1198
10.2%

Most occurring characters

ValueCountFrequency (%)
10000
 
12.1%
a9081
 
11.0%
o6182
 
7.5%
e5788
 
7.0%
n4673
 
5.6%
t4605
 
5.6%
i4124
 
5.0%
r3893
 
4.7%
m3821
 
4.6%
s3293
 
4.0%
Other values (28)27416
33.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter62894
75.9%
Space Separator10000
 
12.1%
Uppercase Letter7057
 
8.5%
Other Punctuation2925
 
3.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a9081
14.4%
o6182
9.8%
e5788
 
9.2%
n4673
 
7.4%
t4605
 
7.3%
i4124
 
6.6%
r3893
 
6.2%
m3821
 
6.1%
s3293
 
5.2%
d2642
 
4.2%
Other values (15)14792
23.5%
Uppercase Letter
ValueCountFrequency (%)
S2051
29.1%
E1538
21.8%
C1363
19.3%
I1013
14.4%
O339
 
4.8%
M255
 
3.6%
F202
 
2.9%
A174
 
2.5%
P47
 
0.7%
J47
 
0.7%
Space Separator
ValueCountFrequency (%)
10000
100.0%
Other Punctuation
ValueCountFrequency (%)
/2925
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin69951
84.4%
Common12925
 
15.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a9081
13.0%
o6182
 
8.8%
e5788
 
8.3%
n4673
 
6.7%
t4605
 
6.6%
i4124
 
5.9%
r3893
 
5.6%
m3821
 
5.5%
s3293
 
4.7%
d2642
 
3.8%
Other values (26)21849
31.2%
Common
ValueCountFrequency (%)
10000
77.4%
/2925
 
22.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII77815
93.9%
None5061
 
6.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10000
12.9%
a9081
11.7%
o6182
 
7.9%
e5788
 
7.4%
n4673
 
6.0%
t4605
 
5.9%
i4124
 
5.3%
r3893
 
5.0%
m3821
 
4.9%
s3293
 
4.2%
Other values (23)22355
28.7%
None
ValueCountFrequency (%)
ç2421
47.8%
ã2247
44.4%
á208
 
4.1%
í160
 
3.2%
ê25
 
0.5%

setor_de_mercado
Categorical

HIGH CORRELATION
MISSING

Distinct17
Distinct (%)1.1%
Missing243
Missing (%)13.8%
Memory size13.9 KiB
Tecnologia/Fábrica de Software
501 
Outras
183 
Finanças ou Bancos
182 
Setor Público
89 
Educação
86 
Other values (12)
481 

Length

Max length30
Median length22
Mean length18.5978975
Min length6

Characters and Unicode

Total characters28306
Distinct characters44
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutras
2nd rowEducação
3rd rowIndústria (Manufatura)
4th rowTecnologia/Fábrica de Software
5th rowInternet/Ecommerce

Common Values

ValueCountFrequency (%)
Tecnologia/Fábrica de Software501
28.4%
Outras183
 
10.4%
Finanças ou Bancos182
 
10.3%
Setor Público89
 
5.0%
Educação86
 
4.9%
Indústria (Manufatura)78
 
4.4%
Varejo73
 
4.1%
Marketing72
 
4.1%
Área da Saúde63
 
3.6%
Internet/Ecommerce60
 
3.4%
Other values (7)135
 
7.6%
(Missing)243
13.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
tecnologia/fábrica501
15.2%
de501
15.2%
software501
15.2%
ou216
 
6.6%
outras183
 
5.6%
finanças182
 
5.5%
bancos182
 
5.5%
setor132
 
4.0%
público89
 
2.7%
educação86
 
2.6%
Other values (17)719
21.8%

Most occurring characters

ValueCountFrequency (%)
a3036
 
10.7%
o2595
 
9.2%
e2371
 
8.4%
r1897
 
6.7%
1770
 
6.3%
c1622
 
5.7%
i1586
 
5.6%
n1538
 
5.4%
t1304
 
4.6%
d806
 
2.8%
Other values (34)9781
34.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter22746
80.4%
Uppercase Letter3073
 
10.9%
Space Separator1770
 
6.3%
Other Punctuation561
 
2.0%
Close Punctuation78
 
0.3%
Open Punctuation78
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a3036
13.3%
o2595
11.4%
e2371
10.4%
r1897
 
8.3%
c1622
 
7.1%
i1586
 
7.0%
n1538
 
6.8%
t1304
 
5.7%
d806
 
3.5%
u725
 
3.2%
Other values (18)5266
23.2%
Uppercase Letter
ValueCountFrequency (%)
S711
23.1%
F692
22.5%
T540
17.6%
E184
 
6.0%
O183
 
6.0%
B182
 
5.9%
M150
 
4.9%
I138
 
4.5%
P104
 
3.4%
V73
 
2.4%
Other values (2)116
 
3.8%
Space Separator
ValueCountFrequency (%)
1770
100.0%
Other Punctuation
ValueCountFrequency (%)
/561
100.0%
Close Punctuation
ValueCountFrequency (%)
)78
100.0%
Open Punctuation
ValueCountFrequency (%)
(78
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin25819
91.2%
Common2487
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a3036
 
11.8%
o2595
 
10.1%
e2371
 
9.2%
r1897
 
7.3%
c1622
 
6.3%
i1586
 
6.1%
n1538
 
6.0%
t1304
 
5.1%
d806
 
3.1%
u725
 
2.8%
Other values (30)8339
32.3%
Common
ValueCountFrequency (%)
1770
71.2%
/561
 
22.6%
)78
 
3.1%
(78
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII27033
95.5%
None1273
 
4.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a3036
 
11.2%
o2595
 
9.6%
e2371
 
8.8%
r1897
 
7.0%
1770
 
6.5%
c1622
 
6.0%
i1586
 
5.9%
n1538
 
5.7%
t1304
 
4.8%
d806
 
3.0%
Other values (26)8508
31.5%
None
ValueCountFrequency (%)
á501
39.4%
ç307
24.1%
ú230
18.1%
ã125
 
9.8%
Á63
 
4.9%
ó19
 
1.5%
ê15
 
1.2%
í13
 
1.0%

plataforma_favorita
Categorical

HIGH CORRELATION
MISSING

Distinct9
Distinct (%)0.6%
Missing140
Missing (%)7.9%
Memory size13.9 KiB
Udemy
466 
Coursera
309 
Udacity
238 
Nunca fiz cursos online
162 
DataCamp
162 
Other values (4)
288 

Length

Max length23
Median length12
Mean length8.270769231
Min length3

Characters and Unicode

Total characters13440
Distinct characters28
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNunca fiz cursos online
2nd rowUdemy
3rd rowAlura
4th rowUdemy
5th rowUdemy

Common Values

ValueCountFrequency (%)
Udemy466
26.4%
Coursera309
17.5%
Udacity238
13.5%
Nunca fiz cursos online162
 
9.2%
DataCamp162
 
9.2%
Alura138
 
7.8%
Kaggle Learn74
 
4.2%
edX52
 
2.9%
DataQuest24
 
1.4%
(Missing)140
 
7.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
udemy466
21.3%
coursera309
14.1%
udacity238
10.9%
nunca162
 
7.4%
fiz162
 
7.4%
cursos162
 
7.4%
online162
 
7.4%
datacamp162
 
7.4%
alura138
 
6.3%
kaggle74
 
3.4%
Other values (3)150
 
6.9%

Most occurring characters

ValueCountFrequency (%)
a1529
 
11.4%
e1161
 
8.6%
r992
 
7.4%
u795
 
5.9%
d756
 
5.6%
U704
 
5.2%
y704
 
5.2%
s657
 
4.9%
o633
 
4.7%
m628
 
4.7%
Other values (18)4881
36.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter10995
81.8%
Uppercase Letter1885
 
14.0%
Space Separator560
 
4.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a1529
13.9%
e1161
10.6%
r992
 
9.0%
u795
 
7.2%
d756
 
6.9%
y704
 
6.4%
s657
 
6.0%
o633
 
5.8%
m628
 
5.7%
c562
 
5.1%
Other values (8)2578
23.4%
Uppercase Letter
ValueCountFrequency (%)
U704
37.3%
C471
25.0%
D186
 
9.9%
N162
 
8.6%
A138
 
7.3%
K74
 
3.9%
L74
 
3.9%
X52
 
2.8%
Q24
 
1.3%
Space Separator
ValueCountFrequency (%)
560
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin12880
95.8%
Common560
 
4.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a1529
 
11.9%
e1161
 
9.0%
r992
 
7.7%
u795
 
6.2%
d756
 
5.9%
U704
 
5.5%
y704
 
5.5%
s657
 
5.1%
o633
 
4.9%
m628
 
4.9%
Other values (17)4321
33.5%
Common
ValueCountFrequency (%)
560
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII13440
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a1529
 
11.4%
e1161
 
8.6%
r992
 
7.4%
u795
 
5.9%
d756
 
5.6%
U704
 
5.2%
y704
 
5.2%
s657
 
4.9%
o633
 
4.7%
m628
 
4.7%
Other values (18)4881
36.3%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

('P0', 'id')('P1', 'age')('P2', 'gender')('P3', 'living_in_brasil')('P5', 'living_state')('P6', 'born_or_graduated')('P8', 'degreee_level')('P10', 'job_situation')('P12', 'workers_number')('P13', 'manager')('P16', 'salary_range')('P17', 'time_experience_data_science')('P18', 'time_experience_before')('P19', 'is_data_science_professional')('P20', 'linear_regression')('P20', 'logistic_regression')('P20', 'glms')('P20', 'decision_tree')('P20', 'random_forest')('P20', 'neural_networks')('P20', 'bayesian_inference')('P20', 'ensemble')('P20', 'svms')('P20', 'cnns')('P20', 'rnns')('P20', 'hmms')('P20', 'gans')('P20', 'markov_chains')('P20', 'nlp')('P20', 'gradient_boosted_machines')('P20', 'cluster_analysis')('P20', 'survival_analysis')('P20', 'longitudinal_data_analysis')('P20', 'joint analysis')('P20', 'no_listed_methods')('P21', 'sql_')('P21', 'r')('P21', 'python')('P21', 'c_c++_c#')('P21', 'dotnet')('P21', 'java')('P21', 'julia')('P21', 'sas_stata')('P21', 'visual_basic_vba')('P21', 'scala')('P21', 'matlab')('P21', 'php')('P21', 'no_listed_languages')('P22', 'most_used_proggraming_languages')('P23', 'sql')('P23', 'nosql')('P23', 'images')('P23', 'nlp')('P23', 'videos')('P23', 'sheets')('P23', 'other')('P24', 'sql')('P24', 'nosql')('P24', 'imagens')('P24', 'nlp')('P24', 'vídeos')('P24', 'planilhas')('P24', 'other')('P25', 'aws')('P25', 'gcp')('P25', 'azure')('P25', 'ibm')('P25', 'on_premise_servers')('P25', 'cloud_própria')('P25', 'other')('P26', 'mysql')('P26', 'oracle')('P26', 'sql_server')('P26', 'aurora')('P26', 'dynamodb')('P26', 'coachdb')('P26', 'cassandra')('P26', 'mongodb')('P26', 'mariadb')('P26', 'datomic')('P26', 's3')('P26', 'postgresql')('P26', 'elaticsearch')('P26', 'db2')('P26', 'ms_access')('P26', 'sqlite')('P26', 'sybase')('P26', 'firebase')('P26', 'vertica')('P26', 'redis')('P26', 'neo4j')('P26', 'google_bigtable')('P26', 'hbase')('P26', 'other')('P27', 'microsoft_powerbi')('P27', 'qlik_view_qlik_sense')('P27', 'tableau')('P27', 'metabase')('P27', 'superset')('P27', 'redash')('P27', 'microstrategy')('P27', 'ibm_analytics_cognos')('P27', 'sap_business_objects')('P27', 'oracle_business_intelligence')('P27', 'birst')('P27', 'looker')('P27', 'google_data_studio')('P27', 'only_excel_gsheets')('P27', 'no_bi_tool_at_work')('P27', 'other')('P28', 'sql_&_stored_procedures')('P28', 'apache_airflow')('P28', 'luigi')('P28', 'aws_glue')('P28', 'talend')('P28', 'pentaho')('P28', 'alteryx')('P28', 'oracle_data_integrator')('P28', 'ibm_data_stage')('P28', 'sap_bw_etl')('P28', 'siss_sql_server_integration_services')('P28', 'other')('P29', 'have_data_warehouse')('P30', 'google_bigquery')('P30', 'aws_redshift')('P30', 'snowflake')('P30', 'oracle')('P30', 'postgres_mysql')('P30', 'ibm')('P30', 'teradata')('P30', 'microsoft_azure')('P30', 'do_not_know')('P30', 'other')('P31', 'data_hackers_blog')('P31', 'data_hackers_podcast')('P31', 'weekly_newsletter')('P31', 'slack_channel')('P31', 'data_hackers_bootcamp')('P31', 'do_not_know_data_hackers')('P32', 'prefered_data_hackers_initiative')('P33', 'telegram_groups')('P33', 'whatsapp_groups')('P33', 'youtube_channels')('P33', 'other_brasilian_blogs')('P33', 'other_slack_channels')('P33', 'twitter')('P33', 'abroad_blogs')('P33', 'abroad_podcasts')('P33', 'meetups_and_events')('P33', 'only_data_hackers')('P33', 'other')('P34', 'udacity')('P34', 'coursera')('P34', 'udemy')('P34', 'height')('P34', 'edx')('P34', 'data_camp')('P34', 'data_quest')('P34', 'kaggle_learn')('P34', 'online_courses')('P34', 'other')('P35', 'data_science_plataforms_preference')('P35', 'other')('P36', 'draw_participation')('D1', 'living_macroregion')('D2', 'origin_macroregion')('D3', 'anonymized_degree_area')('D4', 'anonymized_market_sector')('D5', 'anonymized_manager_level')('D6', 'anonymized_role')profissaoidadesalariotamanho_da_empresagestorse_considera_dssexoexperiencia_dstipo_de_trabalhoescolaridadearea_de_formacaosetor_de_mercadoplataforma_favorita
0v9otv8j9wdvjrv9otvwnn9owhzq54ktv37.0Masculino1Minas Gerais (MG)1.0Estudante de GraduaçãoEmpregado (CTL)de 1 a 50.0de R$ 1.001/mês a R$ 2.000/mêsNão tenho experiência na área de dadosNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000000001Ainda não conhecia o Data Hackers001000000000000000010Nunca fiz cursos onlineNaN1.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoOutrasNaNOutrasOutras[31,40]1500.0PequenanãonãoMasculinoNão tenho experiência na área de dadosEmpregado (CTL)Estudante de GraduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoOutrasNunca fiz cursos online
1875ul998t0hqcv0871uptwf3oswcfv3524.0Feminino1São Paulo (SP)1.0Estudante de GraduaçãoEmpregado (CTL)Acima de 30000.0de R$ 2.001/mês a R$ 3000/mêsMenos de 1 anoNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados10000000000000000000011010000000000Python10000001000000100000001000000001000000000000010000000000000001000000000000.00000000000000001Ainda não conhecia o Data Hackers001000000000010000001NaNData Science Academy0.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoEducaçãoNaNData Analyst/Analista de DadosNaN[18,24]2500.0GrandenãosimFemininoMenos de 1 anoEmpregado (CTL)Estudante de GraduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoEducaçãoNaN
2puscuk079vw1pusbb900pzw2xvpxtgdk26.0Masculino1São Paulo (SP)1.0Graduação/BachareladoEmpregado (CTL)Acima de 30000.0de R$ 4.001/mês a R$ 6.000/mêsde 1 a 2 anosde 6 a 10 anos10000110110000000100000010000000000Python00000100000010000001000000000000000000000000100100000000000000000000000010.00000000000011000Newsletter Semanal000100000000110000100UdemyNaN1.0Região SudesteNaNOutras EngenhariasIndústria (Manufatura)NaNOutrasOutras[25,30]5000.0GrandenãosimMasculinode 1 a 2 anosEmpregado (CTL)Graduação/BachareladoOutras EngenhariasIndústria (Manufatura)Udemy
3rmel8ewqpbffp2mnfbzermel8eqincov21.0Masculino1São Paulo (SP)0.0Estudante de GraduaçãoEstagiáriode 11 a 500.0de R$ 1.001/mês a R$ 2.000/mêsMenos de 1 anode 2 a 3 anos10000100000000000000001010000000000SQL10000001000000100000000000000000000000000000100000000000000010000010000001.00100000000000001Ainda não conhecia o Data Hackers010000010000111000000AluraNaN1.0Região SudesteRegião SudesteComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareNaNBusiness Intelligence/Analista de BIAnalista de BI[18,24]1500.0PequenanãosimMasculinoMenos de 1 anoEstagiárioEstudante de GraduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareAlura
4pj9mgud4d6mdct1l7vq0pj9mgu78h6ju27.0Masculino1NaN1.0Graduação/BachareladoFreelancerde 6 a 101.0de R$ 6.001/mês a R$ 8.000/mêsde 4 a 5 anosde 4 a 5 anos11001100110100010000001011000000000Python11010100001000000000110000001100110010000000000000000000000100000000000010.00000000000100000Blog/Medium do Data Hackers000000000011110000000UdemyNaN1.0NaNNaNComputação / Engenharia de Software / Sistemas de InformaçãoInternet/EcommerceC-level (CDO, CIO, CTO)NaNNaN[25,30]7000.0PequenasimsimMasculinode 4 a 5 anosFreelancerGraduação/BachareladoComputação / Engenharia de Software / Sistemas de InformaçãoInternet/EcommerceUdemy
5cb7n2v7372y97wl1lcb7n2e8tpockl8327.0Masculino1Paraná (PR)1.0Estudante de GraduaçãoEmpregado (CTL)de 101 a 5000.0de R$ 3.001/mês a R$ 4.000/mêsMenos de 1 anode 4 a 5 anos00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000111000Podcast do Data Hackers001000000100010000000UdemyNaN1.0Região SulNaNComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareNaNOutrasOutras[25,30]3500.0MédianãonãoMasculinoMenos de 1 anoEmpregado (CTL)Estudante de GraduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareUdemy
6ayev5viaqe43pxxqqxayev5vtvdba61934.0Masculino1Rio Grande do Sul (RS)1.0Pós-graduaçãoEmpregado (CTL)de 51 a 1001.0de R$ 6.001/mês a R$ 8.000/mêsNão tenho experiência na área de dadosMenos de 1 ano10000000000000000000011010010000010Java10000001000000000010001000000000100000000000000000000000000101000000000000.00000000000111000Podcast do Data Hackers101000000000010000001UdemyNaN1.0Região SulNaNComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareTeam Leader/Tech LeaderNaNNaN[31,40]7000.0NaNsimsimMasculinoNão tenho experiência na área de dadosEmpregado (CTL)Pós-graduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareUdemy
7se3czgy682ew760hzvhmvsse3czpwozo37.0Feminino1São Paulo (SP)0.0Graduação/BachareladoFreelancerde 11 a 501.0de R$ 8.001/mês a R$ 12.000/mêsde 2 a 3 anosNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000000001Bootcamp do Data Hackers010000001000010000001NaNDigital House1.0Região SudesteRegião NordesteOutras EngenhariasOutrasCoordenadorNaNNaN[31,40]10000.0PequenasimnãoFemininode 2 a 3 anosFreelancerGraduação/BachareladoOutras EngenhariasOutrasNaN
8h5zvhct1kbmbb49h5zgzb3c1ce6lnrl626.0Masculino1NaN1.0Estudante de GraduaçãoEstagiáriode 6 a 100.0de R$ 2.001/mês a R$ 3000/mêsNão tenho experiência na área de dadosde 2 a 3 anos00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000100000Blog/Medium do Data Hackers100101100000001000000UdemyNaN0.0NaNNaNComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareNaNDesenvolvedor ou Engenheiro de SoftwareDesenvolvedor ou Engenheiro de Software[25,30]2500.0PequenanãonãoMasculinoNão tenho experiência na área de dadosEstagiárioEstudante de GraduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareUdemy
94t388yqrekd1gsq4t388b9gqkmt2z86x28.0Masculino1São Paulo (SP)1.0MestradoEmpregado (CTL)Acima de 30000.0de R$ 8.001/mês a R$ 12.000/mêsde 2 a 3 anosde 6 a 10 anos11101001000000000000000110000000000R10000100000010100000000000000000000000000000100000000000000100010000000000.00000000000000001Ainda não conhecia o Data Hackers010000000000010000000UdemyNaN1.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoÁrea da SaúdeNaNData Scientist/Cientista de DadosCientista de Dados[25,30]10000.0GrandenãosimMasculinode 2 a 3 anosEmpregado (CTL)MestradoComputação / Engenharia de Software / Sistemas de InformaçãoÁrea da SaúdeUdemy

Last rows

('P0', 'id')('P1', 'age')('P2', 'gender')('P3', 'living_in_brasil')('P5', 'living_state')('P6', 'born_or_graduated')('P8', 'degreee_level')('P10', 'job_situation')('P12', 'workers_number')('P13', 'manager')('P16', 'salary_range')('P17', 'time_experience_data_science')('P18', 'time_experience_before')('P19', 'is_data_science_professional')('P20', 'linear_regression')('P20', 'logistic_regression')('P20', 'glms')('P20', 'decision_tree')('P20', 'random_forest')('P20', 'neural_networks')('P20', 'bayesian_inference')('P20', 'ensemble')('P20', 'svms')('P20', 'cnns')('P20', 'rnns')('P20', 'hmms')('P20', 'gans')('P20', 'markov_chains')('P20', 'nlp')('P20', 'gradient_boosted_machines')('P20', 'cluster_analysis')('P20', 'survival_analysis')('P20', 'longitudinal_data_analysis')('P20', 'joint analysis')('P20', 'no_listed_methods')('P21', 'sql_')('P21', 'r')('P21', 'python')('P21', 'c_c++_c#')('P21', 'dotnet')('P21', 'java')('P21', 'julia')('P21', 'sas_stata')('P21', 'visual_basic_vba')('P21', 'scala')('P21', 'matlab')('P21', 'php')('P21', 'no_listed_languages')('P22', 'most_used_proggraming_languages')('P23', 'sql')('P23', 'nosql')('P23', 'images')('P23', 'nlp')('P23', 'videos')('P23', 'sheets')('P23', 'other')('P24', 'sql')('P24', 'nosql')('P24', 'imagens')('P24', 'nlp')('P24', 'vídeos')('P24', 'planilhas')('P24', 'other')('P25', 'aws')('P25', 'gcp')('P25', 'azure')('P25', 'ibm')('P25', 'on_premise_servers')('P25', 'cloud_própria')('P25', 'other')('P26', 'mysql')('P26', 'oracle')('P26', 'sql_server')('P26', 'aurora')('P26', 'dynamodb')('P26', 'coachdb')('P26', 'cassandra')('P26', 'mongodb')('P26', 'mariadb')('P26', 'datomic')('P26', 's3')('P26', 'postgresql')('P26', 'elaticsearch')('P26', 'db2')('P26', 'ms_access')('P26', 'sqlite')('P26', 'sybase')('P26', 'firebase')('P26', 'vertica')('P26', 'redis')('P26', 'neo4j')('P26', 'google_bigtable')('P26', 'hbase')('P26', 'other')('P27', 'microsoft_powerbi')('P27', 'qlik_view_qlik_sense')('P27', 'tableau')('P27', 'metabase')('P27', 'superset')('P27', 'redash')('P27', 'microstrategy')('P27', 'ibm_analytics_cognos')('P27', 'sap_business_objects')('P27', 'oracle_business_intelligence')('P27', 'birst')('P27', 'looker')('P27', 'google_data_studio')('P27', 'only_excel_gsheets')('P27', 'no_bi_tool_at_work')('P27', 'other')('P28', 'sql_&_stored_procedures')('P28', 'apache_airflow')('P28', 'luigi')('P28', 'aws_glue')('P28', 'talend')('P28', 'pentaho')('P28', 'alteryx')('P28', 'oracle_data_integrator')('P28', 'ibm_data_stage')('P28', 'sap_bw_etl')('P28', 'siss_sql_server_integration_services')('P28', 'other')('P29', 'have_data_warehouse')('P30', 'google_bigquery')('P30', 'aws_redshift')('P30', 'snowflake')('P30', 'oracle')('P30', 'postgres_mysql')('P30', 'ibm')('P30', 'teradata')('P30', 'microsoft_azure')('P30', 'do_not_know')('P30', 'other')('P31', 'data_hackers_blog')('P31', 'data_hackers_podcast')('P31', 'weekly_newsletter')('P31', 'slack_channel')('P31', 'data_hackers_bootcamp')('P31', 'do_not_know_data_hackers')('P32', 'prefered_data_hackers_initiative')('P33', 'telegram_groups')('P33', 'whatsapp_groups')('P33', 'youtube_channels')('P33', 'other_brasilian_blogs')('P33', 'other_slack_channels')('P33', 'twitter')('P33', 'abroad_blogs')('P33', 'abroad_podcasts')('P33', 'meetups_and_events')('P33', 'only_data_hackers')('P33', 'other')('P34', 'udacity')('P34', 'coursera')('P34', 'udemy')('P34', 'height')('P34', 'edx')('P34', 'data_camp')('P34', 'data_quest')('P34', 'kaggle_learn')('P34', 'online_courses')('P34', 'other')('P35', 'data_science_plataforms_preference')('P35', 'other')('P36', 'draw_participation')('D1', 'living_macroregion')('D2', 'origin_macroregion')('D3', 'anonymized_degree_area')('D4', 'anonymized_market_sector')('D5', 'anonymized_manager_level')('D6', 'anonymized_role')profissaoidadesalariotamanho_da_empresagestorse_considera_dssexoexperiencia_dstipo_de_trabalhoescolaridadearea_de_formacaosetor_de_mercadoplataforma_favorita
17553z4afj8ziervly3z4ar7tbmydadwdb9n28.0Masculino1Paraná (PR)0.0Pós-graduaçãoEmpregado (CTL)de 11 a 500.0de R$ 3.001/mês a R$ 4.000/mêsMenos de 1 anode 6 a 10 anos10100101000000010000101010010000100SQL11110101000010000001010000000000100000000000000010000000000000000010000000.00000000000010000Podcast do Data Hackers001000001000011000000UdemyNaN1.0Região SulRegião SudesteComputação / Engenharia de Software / Sistemas de InformaçãoTelecomunicaçãoNaNData Analyst/Analista de DadosNaN[25,30]3500.0PequenanãosimMasculinoMenos de 1 anoEmpregado (CTL)Pós-graduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoTelecomunicaçãoUdemy
1756vkta5v2o695imgyd3779rovkta5v2r4p18.0Masculino1Minas Gerais (MG)1.0Graduação/BachareladoEstagiáriode 501 a 10000.0de R$ 1.001/mês a R$ 2.000/mêsMenos de 1 anode 2 a 3 anos10000100110000100000001110000000000SQL11100001000000110000011100001100000000000000010010000000000000000000000011.00100000000111100Blog/Medium do Data Hackers010000000011110010000UdemyNaN1.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoInternet/EcommerceNaNData Analyst/Analista de DadosNaN[18,24]1500.0MédianãosimMasculinoMenos de 1 anoEstagiárioGraduação/BachareladoComputação / Engenharia de Software / Sistemas de InformaçãoInternet/EcommerceUdemy
17575rg9lph8b5a68di0fk5rg9lphjvlg1vv31.0Masculino1São Paulo (SP)1.0Graduação/BachareladoEmpregado (CTL)de 501 a 10000.0de R$ 6.001/mês a R$ 8.000/mêsde 6 a 10 anosNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000010000Canal do Slack000100100000000000010CourseraNaN1.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoÁrea da SaúdeNaNDesenvolvedor ou Engenheiro de SoftwareDesenvolvedor ou Engenheiro de Software[31,40]7000.0MédianãonãoMasculinode 6 a 10 anosEmpregado (CTL)Graduação/BachareladoComputação / Engenharia de Software / Sistemas de InformaçãoÁrea da SaúdeCoursera
17588gz3i9gwg3uckdqx4r8gz3i9bebm0ktd35.0Masculino1Rio de Janeiro (RJ)1.0Graduação/BachareladoEmpreendedor ou Empregado (CNPJ)Acima de 30001.0de R$ 8.001/mês a R$ 12.000/mêsNão tenho experiência na área de dadosMais de 10 anos00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000101100Canal do Slack000100000101111000000UdemyNaN0.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareCoordenadorDesenvolvedor ou Engenheiro de SoftwareDesenvolvedor ou Engenheiro de Software[31,40]10000.0GrandesimnãoMasculinoNão tenho experiência na área de dadosEmpreendedor ou Empregado (CNPJ)Graduação/BachareladoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareUdemy
17595p99xpvpnh03ebva5p99uezed5g5zcb743.0Masculino1São Paulo (SP)1.0Graduação/BachareladoEmpregado (CTL)de 51 a 1000.0de R$ 8.001/mês a R$ 12.000/mêsMais de 10 anosMais de 10 anos00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000110000Podcast do Data Hackers000100000000010000000UdemyNaN1.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareNaNDesenvolvedor ou Engenheiro de SoftwareDesenvolvedor ou Engenheiro de Software[41,50]10000.0NaNnãonãoMasculinoMais de 10 anosEmpregado (CTL)Graduação/BachareladoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareUdemy
1760qkjyuc8dayrrqahyjqkj841kpgymnpau23.0Feminino1Minas Gerais (MG)1.0Graduação/BachareladoDesempregado, buscando recolocaçãoNaNNaNNaNNão tenho experiência na área de dadosNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000111000Newsletter Semanal000100110000010000100UdemyNaN0.0Região SudesteNaNOutrasNaNNaNNaNNaN[18,24]NaNNaNNaNnãoFemininoNão tenho experiência na área de dadosDesempregado, buscando recolocaçãoGraduação/BachareladoOutrasNaNUdemy
17617z5vvw7ycrxn4sujnv7z5vvw7r36gi5k39.0Masculino1São Paulo (SP)1.0Estudante de GraduaçãoEmpregado (CTL)de 1001 a 30000.0de R$ 4.001/mês a R$ 6.000/mêsde 2 a 3 anosNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000110100Canal do Slack111000001000010000000UdemyNaN1.0Região SudesteNaNComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareNaNBusiness Analyst/Analista de NegóciosAnalista de BI[31,40]5000.0GrandenãonãoMasculinode 2 a 3 anosEmpregado (CTL)Estudante de GraduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareUdemy
17620s2jaountv3tyrae0sir4jd8o5hz19ob36.0Masculino1Paraná (PR)0.0Pós-graduaçãoEmpregado (CTL)Acima de 30000.0de R$ 6.001/mês a R$ 8.000/mêsde 2 a 3 anosMais de 10 anos00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000110000Podcast do Data Hackers001100100000011000000UdemyNaN1.0Região SulRegião NordesteComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareNaNData Engineer/Engenheiro de DadosNaN[31,40]7000.0GrandenãonãoMasculinode 2 a 3 anosEmpregado (CTL)Pós-graduaçãoComputação / Engenharia de Software / Sistemas de InformaçãoTecnologia/Fábrica de SoftwareUdemy
1763r3kvh535z7xai2v8er3kv791tkmcysz829.0Masculino1Minas Gerais (MG)1.0Não tenho graduação formalEmpreendedor ou Empregado (CNPJ)de 11 a 501.0de R$ 8.001/mês a R$ 12.000/mêsde 4 a 5 anosMais de 10 anos11100001000000000000001010000000000SQL11010101000000100000010010000001110000000000010010000000000000100000000011.00100000000111110Canal do Slack001100010001011100000UdacityNaN0.0Região SudesteNaNNaNTecnologia/Fábrica de SoftwareC-level (CDO, CIO, CTO)Data Engineer/Engenheiro de DadosNaN[25,30]10000.0PequenasimsimMasculinode 4 a 5 anosEmpreendedor ou Empregado (CNPJ)Não tenho graduação formalNaNTecnologia/Fábrica de SoftwareUdacity
17641cz0hm5i92rw913f1cz0hmvferfavkkb22.0NaN0NaNNaNNão tenho graduação formalPrefiro não dizerAcima de 30000.0Menos de R$ 1.000/mêsMenos de 1 anoNão tive experiência na área de TI/Engenharia de Software antes de começar a trabalhar na área de dados00000000000000000000000000000000000NaN0000000000000000000000000000000000000000000000000000000000000000000000000NaN0000000000111000Blog/Medium do Data Hackers111000101001110000000UdemyNaN0.0NaNNaNNaNSetor AlimentícioNaNEstatísticoCientista de Dados[18,24]1000.0GrandenãonãoNaNMenos de 1 anoPrefiro não dizerNão tenho graduação formalNaNSetor AlimentícioUdemy